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^ (54) Tide: SECRETED PROTEINS AND USES THEREOF 

(57) Abstract: The invention provides isolated nucleic acid molecules, designated TANGO 253, which encode proteins contain- 
00 ing Clq domains and which are homologous to a human adipocyte complement-mediated protein precursor, TANGO 257, which 
O encode proteins homologous to the human extracellular molecule olfactomedin, a molecule important in the maintenance, growth 
rz and differentiation of cbemosensory alia of olfactory neurons, INTERCEPT 258, which encode Ig domain-containing proteins that 
^ exhibit homology to an antigen (A33) expressed in colonic and small bowel epithelium, and TANGO 281, which encode proteins 

downregulated in megakaryocytes that rail to express the gata-1 transcription factor (a factor critical for blood cell formation) and 
5 can, therefore, represent direct or indirect gata-1 targets. The invention also provides antisense nucleic acid molecules, expression 

vectors containing the nucleic acid molecules of the invention, host cells into which the expression vectors have been introduced, and 
O non-human transgenic animals in which a nucleic acid molecule of the invention has been introduced or disrupted. The invention still 

further provides isolated polypeptides, fusin polypeptides, antigenic peptides and antibodies. Diagnostic, screening and therapeutic 
^ methods utilizing compositions of the invention are aJso provided. 
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SECRETED PROTEINS AND USES 
THEREOF 1 



This application is a continuation-in-part of U.S. patent application Serial No. 
5 09/336,536, filed June 1 8, 2000, the contents of which are incorporated herein by 
reference in its entirety. 

Backeround of the Invention 

Many secreted proteins, for example, cytokines and cytokine receptors, play a vital 

10 role in the regulation of cell growth, cell differentiation, and a variety of specific cellular 
responses. A number of medically useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth hormone, and various interleukins, 
are secreted proteins. Thus, an important goal in the design and development of new 
therapies is the identification and characterization of secreted and transmembrane proteins 

1 5 and the genes which encode them. 

Many secreted proteins are receptors which bind a ligand and transduce an 
intracellular signal, leading to a variety of cellular responses. The identification and 
characterization of such a receptor enables one to identify both the ligands which bind to 
the receptor and the intracellular molecules and signal transduction pathways associated 

20 with the receptor, permitting one to identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal transduction. 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of cDNA 
25 molecules which encode the TANGO 253, 257 and 281 proteins and the INTERCEPT 258 
protein, all of which are either wholly secreted or transmembrane proteins. 

The TANGO 253 proteins are C 1 q domain-containing polypeptides that exhibit 
homology to a human adipocyte complement-related protein precursor. 

The TANGO 257 proteins are homologous to the human extracellular molecule 
30 olfactomedin, a molecule important in the maintenance, growth and differentiation of 
chemosensory cilia of olfactory neurons. 

The INTERCEPT 258 proteins are Ig domain-containing polypeptides that exhibit 
homology to an antigen (A33) expressed in colonic and small bowel epithelium, a protein 
that may represent a cancer cell marker. 
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The TANGO 281 proteins represent proteins downregulated in megakaryocytes 
that fail to express the gata-1 transcription factor (a factor critical for blood cell formation) 
and can, therefore, represent direct or indirect gata-1 targets. 

The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins, 

5 fragments, derivatives, and variants thereof are collectively referred to herein as 

"polypeptides of the invention" or "proteins of the invention." Nucleic acid molecules 
encoding the polypeptides or proteins of the invention are collectively referred to as 
"nucleic acids of the invention/' 

The nucleic acids and polypeptides of the present invention are useful as 

1 0 modulating agents in regulating a variety of cellular processes. Accordingly, in one 

aspect, this invention provides isolated nucleic acid molecules encoding a polypeptide of 
the invention or a biologically active portion thereof. The present invention also provides 
nucleic acid molecules which are suitable for use as primers or hybridization probes for 
the detection of nucleic acids encoding a polypeptide of the invention. 

1 5 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:2, or the nucleotide sequence of the cDNA insert of an 
EpT253 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

20 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:8, SEQ ID NO:9, or the nucleotide sequence of the cDNA insert of an 
EpTm253 clone deposited with ATCC® as Accession Number 207215, or a complement 
thereof. 

25 The invention features nucleic acid molecules which are at least 95% or 98% 

identical to the nucleotide sequence of SEQ ID NO: 15, SEQ ID NO: 16, or the nucleotide 

sequence of the cDNA insert of an EpT257 clone deposited with ATCC® as Accession 

Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which are at least 95% or 98% 
30 identical to the nucleotide sequence of SEQ ID NO:21, SEQ ID NO:22, or the nucleotide 

sequence of the cDNA insert of an EpTm257 clone deposited with ATCC® as Accession 

Number 2072 1 7, or a complement thereof. 

The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 

65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID NO:26, 
35 SEQ ID NO:27, or the nucleotide sequence of the cDNA insert of an EpT258 clone 

deposited with ATCC® as Accession Number 207222, or a complement thereof. 
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The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 
65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID NO:37, 
SEQ ID NO:38, or the nucleotide sequence of the cDNA insert of an EpTm258 clone 
deposited with ATCC® as Accession Number 207221, or a complement thereof. 
5 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence 
of SEQ ID NO:46, SEQ ID NO:47, or the nucleotide sequence of the cDNA insert of an 
EpT281 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

1 0 The invention features nucleic acid molecules which are at least 35%, 40%, 45%, 

50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID 
NO:56, SEQ ID NO:57, or the nucleotide sequence of the cDNA insert of an EpmT281 
clone deposited with ATCC® as patent deposit Number PTA-224, or a complement 
thereof. 

15 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98% identical to the 
nucleotide sequence of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 
57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

20 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191or 192, a complement thereof, or the non- 
coding strand of EpT 253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA of ATCC® Accession 207222, Accession Number 207215, Accession 
207217, Accession Number 207221, or patent deposit Number PTA-224, wherein said 

25 nucleic acid molecules encode polypeptides or proteins that exhibit at least one structural 
and/or functional feature of a polypeptide of the invention. 

The invention features nucleic acid molecules of at least 450, 500, 550, 600, 650, 
700, 750, 800, 850, 900, 1000, 1 100, 1200 or 1300 contiguous nucleotides of the 
nucleotide sequence of SEQ ID NO:l, the nucleotide sequence of an EpT253 cDNA of 

30 ATCC® Accession Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 contiguous 
nucleotides of the nucleotide sequence of SEQ ID NO:2, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 

35 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 contiguous 
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nucleotides of the nucleotide sequence of SEQ ID NO:8 the nucleotide sequence of an 
EpTm253 cDNa of ATCC® Accession Number 207215, or a complement thereof. 

The invention features nucleic acid molecules of at least 310, 350, 400, 450, 500, 
550, 600, 650 or 700 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:9, 
5 or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 1 5 or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
10 1 150 or 1200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 16, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
1 100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:21 the nucleotide sequence of an EpTm257 cDNA of ATCC® 
1 5 Accession Number 2072 1 7, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 150 or 1200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:22, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
20 420, 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, or 
1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:26 the nucleotide 
sequence of an EpT258 cDNA of ATCC® Accession Number 207222, or a complement 
thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
25 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 
1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:27, 
or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
675, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
30 nucleotides of the nucleotide sequence of SEQ ID NO:37 the nucleotide sequence of an 
EpTm258 cDNA of ATCC® Accession Number 207221, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 
contiguous nucleotides of the nucleotide sequence of SEQ ID NO:38, or a complement 
35 thereof. 
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The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 
1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:46 
the nucleotide sequence of an EpT28 1 cDNA of ATCC® Accession Number 207222, or a 
5 complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 1 00, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:47, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
10 550, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 
contiguous nucleotides of the nucleotide sequence of SEQ ID NO:56 the nucleotide 
sequence of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224, or a 
complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 5 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides of the nucleotide sequence 
of SEQ ID NO:57, or a complement thereof. 

The invention features isolated nucleic acid molecules having a nucleotide 
sequence that is at least about 20, 50, 100, 150, 200, 250, 300, 400, 450, 500, 550, 600, 
650, 700 or more contiguous nucleotides identical to the nucleic acid sequence of SEQ ID 
20 NOS: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 
171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a complement thereof, or the non-coding strand of EpT253, 
25 EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT28 1 or EpTm28 1 cDNA of 
ATCC® Accession 207222, Accession number 2072 1 5, Accession Number 2072 1 7, 
Accession Number 207221, or patent deposit number PTA-224, wherein said nucleic acid 
molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 
30 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 40%, 45%, 
50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of 
SEQ ID NO:3, the amino acid sequence encoded by an EpT253 cDNA of ATCC® 
Accession Number 207222, or a complement thereof. 
35 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 95%, or 98% 
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identical to the amino acid sequence of SEQ ID NO: 10, the amino acid sequence encoded 
by an EpTm253 cDNA of ATCC® Accession Number 207 1 1 5 , or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 1 7, the amino acid 
sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
1 0 95%, or 98% identical to the amino acid sequence of SEQ ID NO:23, the amino acid 
sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 2071 17, or a 
complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
1 5 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:28, the amino acid sequence encoded by an EpT258 cDNA of ATCC® Accession 
Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
20 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:39, the amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession 
Number 20722 1 , or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
25 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:48, the amino acid sequence encoded by an EpT281 cDNA of 
ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
30 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:58, the amino acid sequence encoded by an EpTm281 of 
ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a polypeptide or protein having an amino acid sequence that is at least 
35 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the 
amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, or 58, the amino acid 



-6- 



WO 00/78808 



PCT/US00/16883 



sequence encoded by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281, or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 207217, or Accession Number 207221, patent deposit Number PTA- 
224, or a complement thereof, wherein the polypeptide or protein encoded by the 
5 nucleotide sequence also exhibits at least one structural and/or functional feature of a 
polypeptide of the invention. 

In preferred embodiments, the nucleic acid molecules have the nucleotide sequence 
of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or the nucleotide 
sequence of the cDNA clones of ATCC® Accession Number 207222, 207215, 207217, 
1 0 20722 1 , 207222, or PTA-224. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO:3, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO:3, or the amino acid sequence encoded by an EpT253 cDNA of 
1 5 ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 17, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO: 10, or the amino acid sequence encoded by an EpTm253 cDNA of 
20 ATCC® Accession Number 2072 1 5. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 10, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO: 17, or the amino 
acid sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222. 
25 Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:23, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO:23, or the amino 
acid sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 2072 1 7. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
30 polypeptide having the amino acid sequence of SEQ ED NO:3, or a fragment including at 
least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350 or 360 
contiguous amino acids of SEQ ID NO:28, or the amino acid sequence encoded by an 
EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
35 polypeptide having the amino acid sequence of SEQ ID NO:39, or a fragment including at 
least 160, 175, 200, 225, 250, 275, 300, 350, 375 or 385 contiguous amino acids of SEQ 
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ID NO:39, or the amino acid sequence encoded by an EpT258 cDNA of ATCC® 
Accession Number 20722 1 . 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO:48, or a fragment including at 
5 least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 235 or 240 contiguous amino acids 
of SEQ ID NO:48, or the amino acid sequence encoded by an EpT281 cDNA of ATCC® 
Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO:58, or a fragment including at 
10 least 15, 25, 30, 50, 75, 100, 125, 150, 175 or 200 contiguous amino acids of SEQ ID 
NO:58, or the amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent 
deposit Number PTA-224. 

The invention also features nucleic acid molecules which encode a polypeptide 
fragment of at least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200 or more contiguous amino 
1 5 acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58, or the amino acid sequence encoded 
by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or EpTm281 of 
ATCC® Accession Number 207222, Accession Number 207215, Accession Number 
. 207217, Accession Number 207221 or patent deposit Number PTA-224, wherein the 
fragment also exhibits at least one structural and/or functional feature of a polypeptide of 
20 the invention. 

The invention includes nucleic acid molecules which encode a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 10, 
17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 

25 or 164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, wherein the nucleic acid molecule hybridizes 
to a nucleic acid molecule consisting of a nucleic acid sequence encoding SEQ ID NO:3, 
10,28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 

30 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 
164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, or a complement thereof under stringent 
conditions. 

35 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 40%, preferably 45%, 55%, 65%, 75%, 85%, 95% or 
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98% identical to the amino acid sequence of SEQ ID NO:3, or the amino acid sequence 
encoded by an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 40%, preferably 45%, 50%, 55%, 65%, 75%, 85%, 

5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 1 0, or the amino acid 
sequence encoded by an EpTm253 cDNA of ATCC® Accession Number 2072 1 5. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%, 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ID NO: 17, or the amino acid sequence encoded by an EpT257 cDNA of ATCC® 

10 Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%; 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ED NO:23, or the amino acid sequence encoded by an EpTm257 cDNA of 
ATCC® Accession Number 207217. 

1 5 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:28, or the amino acid 
sequence encoded by an EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 

20 acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:39, or the 
amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession Number 
207221. 

Also within the invention are isolated polypeptides or proteins having an amino 
25 acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:48, or the amino acid 
sequence encoded by an EpT281 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
30 75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:58, or the 
amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent deposit Number 
PTA-224. 

The invention also features isolated polypeptides or proteins having an amino acid 
sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 75%, 
35 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 
39, 48 or 58, or the amino acid sequence encoded by EpT253, EpTm253, EpT257, 
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EpTm257, EpT258, EpTm258, EpT281 orEpTm281 of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
20722 1 , patent deposit Number PTA-224, wherein the protein or polypeptides also 
exhibit at least one structural and/or functional feature of a polypeptide of the invention. 

5 Also within the invention are isolated polypeptides or proteins which are encoded 

by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:3, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 

10 hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO:2, a complement thereof, or the non- 
coding strand of an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

1 5 preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO: 10, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, a complement thereof, or the non- 
20 coding strand of an EpTm253 cDNA of ATCC® Accession Number 2072 15. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:28, and isolated polypeptides or proteins which are encoded by a nucleic acid 

25 molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 
SEQ ID NO:27, a complement thereof, or the non-coding strand of an EpT258 cDNA of 
ATCC® Accession Number 207222, 

Also within the invention are isolated polypeptides or proteins which are encoded 

30 by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:39, and isolated polypeptides or proteins which are encoded by a nucleic acid 
molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 

35 SEQ ID NO:38, a complement thereof, or the non-coding strand of an EpTm258 cDNA of 
ATCC® Accession Number 20722 1 . 
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Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ED NO:48, and isolated polypeptides or proteins 

5 which are encoded by a nucleic acid molecule having a nucleotide sequence which 

hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:46 or SEQ ID NO:47, a complement thereof, or the 
non-coding strand of an EpT28 1 cDNA of ATCC® Accession Number 207222 . 

Also within the invention are isolated polypeptides or proteins which are encoded 

10 by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO: 5 8, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 

1 5 nucleotide sequence of SEQ ED NO:56 or SEQ ED NO:57, a complement thereof, or the 
non-coding strand of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224. 

The invention also features isolated polypeptides or proteins which are encoded by 
a nucleic acid molecule having a nucleotide sequence that is at least about 30%, preferably 
35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to a nucleic 

20 acid sequence encoding SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, isolated polypeptides or proteins which are 
encoded by a nucleic acid molecule having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule having the nucleotide 

25 sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 
103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
187, 188, 189, 190, 191 or 192, a complement thereof, or the non-coding strand of 

30 EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281, EpTm281 of 
ATCC® Accession Number 207222, Accession Number 20721 5, Accession Number 
207217, Accession Number 207221, patent deposit Number PTA-224, wherein 
polypeptides or proteins also exhibit at least one structural and/or functional feature of a 
polypeptide of the invention. 
3 5 Also within the invention are polypeptides which are naturally occurring allelic 

variants of a polypeptide that includes the amino acid sequence of SEQ ED NO:3, 10, 17, 
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23, 28, 39, 48 or 58, or the amino acid sequence encoded by a cDNA of ATCC® 
Accession Number 207222, Accession Number 207215, Accession Number 207217 
Accession Number 207221, or patent deposit Number PTA-224, wherein the polypeptide 
is encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule having 

5 the sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a 
complement thereof under stringent conditions. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l or 
2, or an EpT253 cDNA of ATCC® Accession Number 207222, or a complement thereof. 

10 In other embodiments, the nucleic acid molecules are at least 450, 500, 550, 600, 650, 
700, 750, 800, 1000, 1100, 1200 or 1300 contiguous nucleotides in length and hybridize 
under stringent conditions to a nucleic acid molecule comprising the nucleotide sequence 
of SEQ ID NO:l or 2, an EpT253 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. 

1 5 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:8 or 
SEQ ID NO:9, an EpTm253 cDNA of ATCC® Accession Number 207215, or a 
complement thereof. In other embodiments, the nucleic acid molecules are at least 540, 
550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 1 159, 1200, or 1250 

20 contiguous nucleotides in length and hybridize under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, an 
EpTm253 cDNA of ATCC® Accession Number 207215, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 15 or 

25 SEQ ID NO: 1 6, an EpT257 cDNA of ATCC® Accession Number 207222, or a 

complement thereof and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 17, or encode a polypeptide comprising at least 360, 370, 380, 390 or 400 
contiguous amino acids or SEQ ID NO: 17. 

The invention also features nucleic acid molecules that hybridize under stringent 

30 conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:21 or 
SEQ ID NO:22, an EpTm257 cDNA of ATCC® Accession Number 207217, or a 
complement thereof, and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO:23, or a polypeptide comprising at least 360, 370, 380, 390, or 400 
contiguous amino acids of SEQ ID NO:23. 

35 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 
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SEQ ED NO:27, an EpT258 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. In other embodiments, the nucleic acid molecules are at least 550, 
600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 

5 comprising the nucleotide sequence of SEQ ID NO:26 or SEQ ID NO:27, an EpT258 
cDNA of ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
SEQ ID NO:38, an EpTm258 cDNA of ATCC® Accession Number 207221, or a 

10 complement thereof. In other embodiments, the nucleic acid molecules are at least 650, 
700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:37 or SEQ ID NO:38, an EpTm258 
cDNA of ATCC® Accession Number 207221, or a complement thereof. 

1 5 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:46 or 
47, an EpTm281 cDNA of ATCC® Accession Number 207222, or a complement thereof. 
In other embodiments, the nucleic acid molecules are at least 710, 750, 800, 900, 1000, 
1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and 

20 hybridize under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:46 or SEQ ID NO:47, an EpT281 cDNA of ATCC® Accession 
Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:56 or 

25 57, an EpTm28 1 cDNA of ATCC® patent deposit Number PTA-224, or a complement 
thereof. In other embodiments, the nucleic acid molecules are at least 580, 600, 700, 800, 
900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:56 or SEQ ID NO:57, an EpTm281 

30 cDNA of ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l, 2, 
8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 103, 105, 107, 109, 1 1 1, 1 13, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 

35 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a 
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nucleotide sequence of EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281 or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 207217, Accession Number 207221, patent deposit Number PTA-224, 
or complement thereof, wherein such nucleic acid molecules encode polypeptides or 
5 proteins that exhibit at least one structural and/or functional feature of a polypeptide of the 
invention. 

The invention also features nucleic acid molecules at least 15, preferably at least 
50, at least 75, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, 
at least 400, at least 500, at least 600, at least 700, at least 800, at least 1000, at least 1 100 

10 or at least 1200 or more contiguous nucleotides in length which hybridize under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 104, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 

15 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a nucleotide sequence of EpT253, EpTm253, EpT257, EpTm257, 
EpT258, EpTm258, EpT281 or EpTm281 of ATCC® Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221, 
patent deposit Number PTA-224, or a complement thereof, wherein said nucleic acid 

20 molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 

In one embodiment, the invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid of the invention. 

Another aspect of the invention provides vectors, e.g., recombinant expression 

25 vectors, comprising a nucleic acid molecule of the invention. In another embodiment, the 
invention provides host cells containing such a vector or engineered to contain and/or 
express a nucleic acid molecule of the invention. The invention also provides methods for 
producing a polypeptide of the invention by culturing, in a suitable medium, a host cell of 
the invention such that a polypeptide of the invention is produced. 

30 Another aspect of this invention features isolated or recombinant proteins and 

polypeptides of the invention. Preferred proteins and polypeptides possess at least one 
biological activity possessed by the corresponding naturally-occurring human polypeptide. 
An activity, a biological activity, or a functional activity of a polypeptide or nucleic acid 
of the invention refers to an activity exerted by a protein, polypeptide or nucleic acid 

35 molecule of the invention on a responsive cell as determined in vivo or in vitro, according 
to standard techniques. Such activities can be a direct activity, such as an association with 
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or an enzymatic activity on a second protein, or an indirect activity, such as a cellular 
signaling activity mediated by interaction of the protein with a second protein. 

For TANGO 253, biological activities include, e.g., (1) the ability to modulate (this 
term, as used herein, includes, but is not limited to, "stabilize", promote, inhibit or disrupt, 
5 protein-protein interactions (e.g., homophilic and/or heterophilic), and protein-ligand 
interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of cells of the 
central nervous system such as neurons, glial cells (e.g, astrocytes and oligodendrocytes), 
and Schwann cells; (3) the ability to modulate the development of central nervous system; 
1 0 (4) the ability to modulate the development, differentiation, maturation, proliferation 
and/or activity of renal cells; (5) the ability to modulate the development, differentiation, 
maturation, proliferation and/or activity of testical cells, such as germ cells, leydig cells 
and Sertoli cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity of ovarian cells; (7) ability to modulate cell-cell interactions 

1 5 and/or cell-extracellular matrix interactions; (8) the ability to modulate the host immune 
response, e.g., by modulating one or more elements in the serum complement cascade; (9) 
the ability to modulate the proliferation, differentiation and/or activity of cells that form 
blood vessels and coronary tissue (e.g., coronary smooth muscle cells and/or blood vessel 
endothelial cells); (10) the ability to modulate intracellular signaling cascades (e.g., signal 

20 transduction cascades); and (1 1) the ability to modulate adipocyte function. 

For TANGO 257, biological activities include, e.g., (1) the ability to modulate the 
development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons (2) the ability to modulate the development, differentiation, proliferation and/or 
activity of pulmonary system cells, e.g., lung cell types; (4) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of bone cells such as 
osteocytes, osteoblasts and osteoclasts (e.g., the ability promote the development of 
osteocytes); (5) the ability to modulate the development of bone structures such as the 
skull, the basisphenoid bone, the upper and lower incisor teeth, the vertebral column, the 
sternum, the scapula, and the femur during embryogenesis; (6) the ability to modulate the 

30 development, differentiation, maturation, proliferation and/or activity of renal cells; (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of intestinal cells such as M cells; (8) the ability to modulate cell-cell interactions 
and/or cell-extracellular matrix interactions, e.g., neuronal cell-extracellular matrix 
interactions; (9) the ability to modulate cell proliferation, e.g, abnormal cell proliferation; 

35 and ( 1 0) the ability to modulate the development, differentiation, proliferation and/or 
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activity of cells that form blood vessels and coronary tissue, e.g., coronary smooth muscle 
cells and/or blood vessel endothelial cells. 

For INTERCEPT 258, biological activities include, e.g., (1) the ability to modulate 
protein-protein interactions {e.g., homophilic and/or heterophilic), and protein-ligand 

5 interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate cell-cell 
interactions; (3) the ability to modulate the host immune response; (4) the ability to 
modulate the development, differentiation, maturation, proliferation and/or activity of 
pulmonary system cells such as bronchial cells; (5) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of renal cells; (5) the 

10 ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of cardiac cells such cardiac myocytes; (6) the ability to modulate the 
development of brown fat (e.g., the promotion of the development of brown fat); (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of endothelial cells; (8) the ability to modulate cell proliferation, e.g., 

1 5 gastrointestinal tract epithelial cell proliferation; (9) the ability to modulate intracellular 
signaling cascades (e.g., signal transduction cascades); and (10) the ability to modulate 
thrombosis (e.g., the ability to facilitate the removal of blood clots) and/or vascularization 
(e.g. , the promotion of vascularization). 

For TANGO 281, biological activities include, e.g., (1) the ability to modulate, 

20 e.g., stabilize, promote, inhibit or disrupt protein-protein interactions (e.g., homophilic 
and/or heterophilic), and protein-ligand interactions, e.g., in receptor-ligand recognition; 
(2) the ability to modulate cell-cell interactions; (3) the ability to modulate the host 
immune response; (4) the ability to modulate the proliferation, differentiation and/or 
activity of hematopoeitic cells (e.g. megakaryocytes); (5) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of pulmonary 
system cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity intestinal cells such as M cells; (7) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of stomach cells 
such as cells of the gastric epithelium; (8) the ability to modulate intracellular signaling 

30 cascades (e.g., signal transduction cascades); and (9) the ability to modulate platelet 
function (e.g., the promotion of platelet aggregation). 

In one embodiment, a polypeptide of the invention has an amino acid sequence 
sufficiently identical to an identified domain of a polypeptide of the invention. As used 
herein, the term "sufficiently identical" refers to a first amino acid or nucleotide sequence 

35 which contains a sufficient or minimum number of identical or equivalent (e.g., with a 

similar side chain) amino acid residues or nucleotides to a second amino acid or nucleotide 
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sequence such that the first and second amino acid or nucleotide sequences have or encode 
a common structural domain and/or common functional activity. For example, amino acid 
or nucleotide sequences which contain or encode a common structural domain having 
about 60% identity, preferably 65% identity, more preferably 75%, 85%, 95%, 98% or 
5 more identity are defined herein as sufficiently identical. 

In one embodiment, a TANGO 253 protein includes at least one or more of the 
following domains: a signal sequence, a collagen domain and a Clq domain. 

In one embodiment, a TANGO 257 protein includes at least a signal peptide. 

In one embodiment, an INTERCEPT 258 includes at least one or more of the 
10 following domains: a signal sequence, an extracellular domain, an immunoglobulin (Ig) 
domain, a transmembrane domain, and an intracellular or cytoplasmic domain. 

In one embodiment, a TANGO 281 protein includes at least one or more of the 
following domains: a signal sequence, an extracellular domain, a photosystem II 10 kD 
phosphoprotein domain, a transmembrane domain, and an intracellular or cytoplasmic 
15 domain. 

The polypeptides of the present invention, or biologically active portions thereof, 
can be operably linked to a heterologous amino acid sequence to form fusion proteins. 
The invention further features antibodies, such as monoclonal or polyclonal antibodies, 
that specifically bind a polypeptide of the invention. In addition, the polypeptides of the 

20 invention or biologically active portions thereof can be incorporated into pharmaceutical 
compositions, which optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides methods for detecting the 
presence, activity or expression of a polypeptide of the invention in a biological sample by 
contacting the biological sample with an agent capable of detecting an indicator of the 

25 presence, activity or expression such that the presence activity or expression of a 
polypeptide of the invention is detected in the biological sample. 

In another aspect, the invention provides methods for modulating activity of a 
polypeptide of the invention comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a polypeptide of the invention such 

30 that activity or expression in the cell is modulated. In one embodiment, the agent is an 
antibody that specifically binds to a polypeptide of the invention. 

In another embodiment, the agent modulates expression of a polypeptide of the 
invention by modulating transcription, splicing, or translation of an mRNA encoding a 
polypeptide of the invention. In yet another embodiment, the agent is a nucleic acid 

35 molecule having a nucleotide sequence that is antisense to the coding strand of an mRNA 
encoding a polypeptide of the invention. 
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The present invention also provides methods to treat a subject having a disorder 
characterized by aberrant activity of a polypeptide of the invention or aberrant expression 
of a nucleic acid of the invention by administering an agent which is a modulator of the 
activity of a polypeptide of the invention or a modulator of the expression of a nucleic acid 

5 of the invention to the subject. In one embodiment, the modulator is a protein of the 
invention. In another embodiment, the modulator is a nucleic acid of the invention. In 
other embodiments, the modulator is a peptide, peptidomimetic, or other small molecule. 

The present invention also provides diagnostic assays for identifying the presence 
or absence of a genetic lesion or mutation characterized by at least one of: (i) aberrant 

10 modification or mutation of a gene encoding a polypeptide of the invention; (ii) mis- 
regulation of a gene encoding a polypeptide of the invention; and (iii) aberrant post- 
translational modification of the invention wherein a wild-type form of the gene encodes a 
protein having the activity of the polypeptide of the invention. 

In another aspect, the invention provides a method for identifying a compound that 

1 5 binds to or modulates the activity of a polypeptide of the invention. In general, such 
methods entail measuring a biological activity of the polypeptide in the presence and 
absence of a test compound and identifying those compounds which alter the activity of 
the polypeptide. 

The invention also features methods for identifying a compound which modulates 
20 the expression of a polypeptide or nucleic acid of the invention by measuring the 
expression of the polypeptide or nucleic acid in the presence and absence of the 
compound. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, humanized, chimeric and non-human antibodies or 

25 fragments thereof, which antibodies or fragments specifically bind to a polypeptide 

comprising an amino acid sequence of SEQ ID NO: 3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 

30 EpTm281 cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
207222, Accession Number 207215, Accession number 207217, Accession number 
20722 1 , or patent deposit Number PTA-224. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

35 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 15 contiguous amino acids of the amino acid sequence of SEQ ID NO: 
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3, 10, 17, 23,28, 39,48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162 or 164, or the amino acid sequence encoded by the EpT253, EpTm253, EpT257, 
EpTm257, EpT258, EpTm258, EpT281 or EpTm281 cDNA insert of the plasmid 
5 deposited with the ATCC® as Accession Number 207222, Accession number 2072 1 5, 
Accession number 207217, Accession number 207221, or patent deposit number PTA- 
224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 
1 0 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 95% identical to the amino acid sequence of SEQ ID NO: 3, 10, 17, 23, 
28, 39,48,58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or 
the amino acid sequence encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, 
1 5 EpTm258, EpT281 or EpTm281 cDNA insert of the plasmid deposited with the ATCC® 
as Accession Number 207222, Accession number 207215, Accession number 207217, 
Accession number 207221, or patent deposit number PTA-224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 
20 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

encoded by a nucleic acid molecule which hybridizes to the nucleic acid molecule of SEQ 
ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47,*56, 57, 77, 80, 91, 100, 101, 103, 
104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 
25 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 under conditions of hybridization of 6 X SSC at 45°C and 
washing in 0.2 X SSC, 0. 1% SDS at 65 °C. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 
30 conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 

fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
35 invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutical^ acceptable carrier. In preferred embodiments, the pharmaceutical 
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composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutical^ acceptable carrier. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

5 

Brief Description of the Drawings 

FIGURES 1 A-AB depict the cDNA sequence of human TANGO 253 (SEQ ID 
NO:l) and the predicted amino acid sequence of human TANGO 253 (SEQ ID NO:3). 
The open reading frame of SEQ ID NO: 1 extends from nucleotide 1 88 to nucleotide 9 1 6 

10 of SEQ ID NO: 1 (SEQ ID NO:2). 

FIGURE 2 depicts a hydropathy plot of human TANGO 253. Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 

1 5 dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; 
SEQ ID NO: 5) on the left from the mature protein (amino acids 16 to 243 of SEQ ID 
NO:3; SEQ ID NO:4) on the right. Below the hydropathy plot, the amino acid sequence 
of human TANGO 253 is depicted. 

FIGURES 3A-3B depict a cDNA sequence of mouse TANGO 253 (SEQ ID NO:8) 

20 and the predicted amino acid sequences of mouse TANGO 253 (SEQ ID NO: 1 0). The 
open reading frame of SEQ ID NO: 10 extends from nucleotide 135 to 863 of SEQ ID 
NO:10(SEQIDNO:9). 

FIGURE 4 depicts a hydropathy plot of mouse TANGO 253. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

25 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 

cysteine residues (cys) are indicated by short vertical lines just below the hydropathy trace. 
The dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID 
NO: 10; SEQ ID NO: 12) on the left from the mature protein (amino acids 16 to 243 of 
SEQ ID NO:10; SEQ ID NO:l 1) on the right. Below the hydropathy plot, the amino acid 

30 sequence of mouse TANGO 253 is depicted. 

FIGURE 5 depicts an alignment of the amino acid sequence of human TANGO 
253 (SEQ ID NO:3) and the amino acid sequence of mouse TANGO 253 (SEQ ID 
NO: 10). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 253 are 93.8% identical. This alignment was performed using the ALIGN 

35 program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 
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FIGURES 6A-6B depict alignments of the amino acid sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:20; Swiss Prot Accession 
Number Q15848) and the amino acid sequence of human TANGO 253 (SEQ ID NO:3; 
6A) or mouse TANGO 253 (SEQ ID NO: 10; 6B). 6A shows the amino acid sequences of 

5 human adipocyte complement-mediated protein precursor and human TANGO 253 are 
38.7% identical. 6B shows the amino acid sequences of human adipocyte complement- 
mediated precursor procursor protein and mouse TANGO 253 are 38.3% identical. These 
alignments were performed using the ALIGN alignment program with a PAM120 scoring 
matrix, a gap length penalty of 1 2, and a gap penalty of 4. 
1 0 FIGURES 7A-7C depict alignments of the nucleotide sequence of human 

adipocyte complement-mediated protein precursor (SEQ ID NO:32; GenBank Accession 
Number A141 7523) and the nucleotide sequence of human TANGO 253 (SEQ ID NO:l) . 
The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and human TANGO 253 are 29.1% identical. These alignments were performed using the 

1 5 ALIGN alignment program with a PAM 1 20 scoring matrix, a gap length penalty of 1 2, 
and a gap penalty of 4. 

FIGURES 8A-8C depict alignments of the nucleotide sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:32; GenBank Accession 
Number A1417523) and the nucleotide sequence of mouse TANGO 253 (SEQ ID NO:8). 

20 The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and mouse TANGO 253 are 30.4% identical. These alignments were performed using the 
ALIGN alignment program with a PAM 120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 9A-9B depict the cDNA sequence of human TANGO 257 (SEQ ID 

25 NO:15) and the predicted amino acid sequence of human TANGO 257 (SEQ ID NO:17). 
The open reading frame of SEQ ID NO: 16 extends from nucleotide 88 to nucleotide 1305 
of SEQ ID NO: 15 (SEQ ID NO: 16). 

FIGURE 10 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

30 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 21 of SEQ ID NO: 16; SEQ ID NO: 19) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO: 16; SEQ ID NO: 18) on the right. 

35 Below the hydropathy plot, the amino acid sequence of human TANGO 257 is depicted. 
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FIGURES 1 1 A-l IB depict a cDNA sequence of mouse TANGO 257 (SEQ ID 
NO:21) and the predicted amino acid sequence of mouse TANGO 257 (SEQ ID NO:23). 
The open reading frame of SEQ ID NO:21 extends from nucleotide 31 to 1248 of SEQ ID 
NO:21(SEQIDNO:22). 

5 FIGURE 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 

hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 

10 sequence (amino acids 1 to 21 of SEQ ID NO:23; SEQ ID NO:25) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO:23; SEQ ID NO:24) on the right. 
Below the hydropathy plot, the amino acid sequence of mouse TANGO 257 is depicted. 

FIGURE 13 depicts an alignment of the amino acid sequence of human TANGO 
257 (SEQ ID NO: 17) and the amino acid sequence of mouse TANGO 257 (SEQ ID 

1 5 NO:23). This alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 257 are 94.1% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 14 depicts an alignment of the amino acid sequence (SEQ ID NO:43) 
encoded by a nucleotide sequence referred to in PCT publication WO 98/39446 as "gene 

20 64", and the amino acid sequence of human TANGO 257 (SEQ ID NO: 1 7). Gene 64 
encodes a 353 amino acid residue protein that exhibits homology with the human 
extracellular molecule olfactomedin, which is though to be involved in maintenance, 
growth and/or differentiation of chemosensory cilia on the apical dendrites of olfactory 
neurons. The polypeptide encoded by gene 64 also exhibits homology to human TANGO 

25 257, which contains 406 amino acids (i.e., an additional 53 amino acids carboxy to residue 
353). The amino acid sequences of amino acid residues 1-353 of the gene 64-encoded 
polypeptide and human TANGO 257 are identical. As such, the overall amino acid 
sequence identity between the full length polypeptide encoded by gene 64, and the full- 
length human TANGO 257 polypeptide is approximately 87%. This alignment was 

30 performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 1 2, and a gap penalty of 4. 

FIGURES 15 A-l 5D depict an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 1 5). The nucleotide sequences of gene 64 and human 

35 TANGO 257 are 93.5% identical. It is noted, however, that among the differences 
between the two sequences is a cytosine nucleotide at human TANGO 257 (SEQ ID 
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NO: 15) position 1 146 that results in a human TANGO 257 amino acid sequence (SEQ ID 
NO:17) of 406 amino acids as opposed to the gene 64 amino acid sequence of only 353 
amino acids (SEQ ID NO:43). Alignment of the nucleotide sequence of the gene 64 open 
reading frame and that of human TANGO 257 (SEQ ID NO: 1 6) show that the two 
5 nucleotide sequences are 87.2% identical. These alignments were performed using the 
ALIGN program with a PAM220 scoring matrix, a gap length penalty of 12 and a gap 
penalty of 4. 

FIGURE 16 depicts an alignment of the acid sequence of the gene 64-encoded 
polypeptide (SEQ ID NO:43) and the amino acid sequence of mouse TANGO 257 (SEQ 
10 ID NO:23). The sequences exhibit an overall amino acid sequence identity of 

approximately 81.8%. This alignment was performed using an ALIGN program with a 
PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 17A-17C depicts an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66) and the nucleotide sequence of mouse TANGO 257 (SEQ ID NO:21). 

1 5 The two sequences are approximately 76.2% identical. Alignment of the nucleotide 
sequence of the gene 64 open reading frame and that of mouse TANGO 257 (SEQ ID 
NO:22) show that the two nucleotide sequences are 77.8% identical. These alignments 
were performed using the ALIGN program with a PAM220 scoring matrix, a gap length 
penalty of 12 and a gap penalty of 4. 

20 FIGURES 1 8A-1 8B depict the cDNA sequence of human INTERCEPT 258 (SEQ 

ID NO:26) and the predicted amino acid sequence of INTERCEPT 258 (SEQ ID NO:28). 
The open reading frame of SEQ ID NO:26 extends from nucleotide 153 to nucleotide 
1262 of SEQ ID NO:26 (SEQ ID NO:27). 

FIGURE 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 

25 hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (Cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. Below the hydropathy plot, the amino acid 
sequence of human INTERCEPT 258 is depicted. 

30 FIGURES 20A-20B depict a cDNA sequence of mouse INTERCEPT 258 (SEQ ID 

NO:37) and the predicted amino acid sequence of mouse INTERCEPT 258 (SEQ ID 
NO:39). The open reading frame of SEQ ID NO:37 extends from nucleotide 107 TO 1288 
of SEQ ID NO:60 (SEQ ID NO:38). 

FIGURE 21 depicts a hydropathy plot of mouse INTERCEPT 258. Relatively 

35 hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
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cysteine residues (cys) and potential N-glycosyiation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 29 of SEQ ID NO:39; SEQ ID NO:41) on the left from the 
mature protein (amino acids 30 to 394 of SEQ ID NO:39; SEQ ID NO:40) on the right. 
5 Below the hydropathy plot, the amino acid sequence of mouse INTERCEPT 258 is 
depicted. 

FIGURE 22 depicts an alignment of the amino acid sequence of human 
INTERCEPT 258 (SEQ ID NO:28) and the amino acid sequence of mouse INTERCEPT 
258 (SEQ ID NO:39). The alignment demonstrates that the amino acid sequences of 
10 human and mouse INTERCEPT 258 are 62.8% identical. This alignment was performed 
using the ALIGN program with a PAM120 scoring matrix, a gap length penalty of 12 and 
a gap penalty of 4. 

FIGURE 23 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 

1 5 sequence of human INTERCEPT 258 (SEQ ID NO:28). The A33 antigen is a 

transmembrane glycoprotein and member of the Ig superfamily that may be a cancer cell 
marker. The amino acid sequences of A33 antigen and human INTERCEPT 258 are 23% 
identical. This alignment was performed using the ALIGN alignment program with a 
PAM120 scoring matrix, a gap length penalty of 12, and a gap penalty of 4. 

20 FIGURES 24A-24D depict an alignment of the nucleotide sequence of human A33 

antigen (SEQ ID NO:68; Gen Bank Accession Number U79725) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ID NO:26). These two nucleotide sequences 
are 40.6% identical. The nucleotide sequence of the open reading frame of human A33 
antigen and that of human INTERCEPT 258 are 44% identical. These alignments were 

25 performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 25 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 
sequence of mouse INTERCEPT 258 (SEQ ID NO:39). These two amino acid sequences 

30 have an overall amino acid identity of 23%. This alignment was performed using the 
ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 26A-26D depict an alignment of the nucleotide sequence of human A33 
antigen (SEQ ID NO:68; GenBank Accession Number U79725) and the nucleotide 

35 sequence of mouse INTERCEPT 258 (SEQ ID NO:37). These two nucleotide sequences 
are 40% identical. The nucleotide sequence of the open reading frame of human A33 
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antigen and that of mouse INTERCEPT 258 are 43.2% identical. These alignments were 
performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 27 A-27E depict an alignment of the nucleotide sequence of human 
5 PECAM-1 , an integrin expressed on endothelial cells (SEQ ID NO:72) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ED NO:26). These two nucleotide sequences 
are 40.5% identical. This alignment was performed using ALIGN alignment program 
with a PAM 1 20 scoring matrix, a gap length of 1 2, and a gap penalty of 4. 

FIGURE 28A-28B depict the cDNA sequence of human TANGO 281 (SEQ ID 
10 NO:46) and the predicted amino acid sequence of human TANGO 281 (SEQ ID NO:48). 
The open reading frame of SEQ ID NO:66 extends from nucleotide 65 to nucleotide 799 
of SEQ ID NO:46 (SEQ ED NO:47). 

FIGURE 29 depicts a hydropathy plot of human TANGO 281. Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
15 hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID NO:48; 
SEQ ID NO:49) on the left from the mature protein (amino acids 39 to 245 of SEQ ID 
NO:48; SEQ ID NO:50) on the right. Below the hydropathy plot, the amino acid sequence 
20 of human TANGO 28 1 is depicted. 

FIGURE 30 depicts an alignment of the amino acid sequence of photosystem II 10 
kD phosphoprotein domain (SEQ ED NO:69; GenBank Accession Number PF00737) and 
the amino acid sequence 97 to 146 of human TANGO 281 (SEQ ED NO:48). This 
alignment was performed using the ALIGN alignment program with a PAM 120 scoring 
25 matrix, a gap length penalty of 12, and a gap penalty of 4. 

FIGURES 31A-31B depict the cDNA sequence of mouse TANGO 281 (SEQ ID 
NO:56) and the predicted amino acid sequence of mouse TANGO 281 (SEQ ID NO:58). 
The open reading frame of SEQ ED NO:56 extends from nucleotide 90 to nucleotide 728 
of SEQ ED NO:56 (SEQ ED NO:57). 
30 Figure 32 depicts a hydropathy plot of mouse TANGO 281. Relatively 

hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ED NO:58; 
35 SEQ ED NO:59) on the left from the mature protein (amino acids 27 to 2 1 3 of SEQ ED 
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NO:58; SEQ ID NO:60) on the right. Below the hydropathy plot, the amino acid sequence 
of mouse TANGO 28 1 is depicted. 

FIGURE 33 depicts an alignment of the amino acid sequence of human TANGO 
281 (SEQ ID NO:48) and the amino acid sequence of mouse TANGO 281 (SEQ ID 
5 NO: 58). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 281 are 66.5% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

Detailed Description of the Invention 

10 The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins and 

nucleic acid molecules comprise families of molecules having certain conserved structural 
and functional features. As used herein, the terms "family" or "families" are intended to 
mean two or more proteins or nucleic acid molecules having a common structural domain 
and having sufficient amino acid or nucleotide sequence identity as defined herein. 

1 5 Family members can be from either the same or different species. For example, a family 
can comprises two or more proteins of human origin, or can comprise one or more 
proteins of human origin and one or more of non-human origin. Members of the sarnie 
family may also have common structural domains. 

For example, TANGO 253 proteins, TANGO 257 proteins, INTERCEPT 258 

20 proteins and TANGO 281 proteins of the invention have signal sequences. As used 
herein, a "signal sequence" includes a peptide of at least about 1 5 or 20 amino acid 
residues in length which occurs at the N-terminus of secretory and membrane-bound 
proteins and which contains at least about 70% hydrophobic amino acid residues such as 
alanine, leucine, isoleucine, phenylalanine, proline, tyrosine, tryptophan, or valine. In a 

25 preferred embodiment, a signal sequence contains at least about 10 to 40 amino acid 

residues, preferably about 19-34 amino acid residues, and has at least about 60-80%, more 
preferably 65-75%, and more preferably at least about 70% hydrophobic residues. A 
signal sequence serves to direct a protein containing such a sequence to a lipid bilayer. 
Thus, in one embodiment, a TANGO 253 protein contains a signal sequence of about 

30 amino acids 1 to 15 of SEQ ID NO:3 (SEQ ID NO:5) or about amino acids 1 to 15 of SEQ 
ID NO: 10 (SEQ ID NO: 12). In another embodiment, a TANGO 257 protein contains a 
signal sequence of about amino acids 1 to 21 of SEQ ID NO: 17 (SEQ ID NO: 19) or about 
amino acids 1 to 21 of SEQ ID NO:23 (SEQ ID NO:25). In another embodiment, an 
INTERCEPT 258 protein contains a signal sequence at about amino acids 1 to 29 of SEQ 

35 ID NO:28 (SEQ ID NO:30) or about amino acids 1 to 29 of SEQ ID NO:39 (SEQ ID 

NO:41). In yet another embodiment, a TANGO 281 protein contains a signal sequence of 
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about amino acids 1 to 38 of SEQ ID NO:48 (SEQ ID NO:49) or about amino acids 1 to 
26 of SEQ ID NO:58 (SEQ ID NO:59). The signal sequence is cleaved during processing 
of the mature protein. 

In one embodiment, TANGO 253 includes at least one RGD cell attachment site. 

5 An RGD domain contains a contiguous arginine-glycine-aspartic acid amino acid 
sequence and is involved in cell-cell, cell-extracellular matrix and cell adhesion 
interactions. In a preferred embodiment, a TANGO 253 family member has the amino 
acid sequence of SEQ ID NO:3 and, preferably, a RGD cell attachment site is located at 
about amino acid positions 77 to 79. 

10 TANGO 253 family members can also include a collagen domain. As used herein, 

the term "collagen domain" refers to a protein domain containing a G-X-Y amino acid 
repeat motif, wherein the first amino acid residue is glycine and the second and third 
amino acid residues can be any residue but are preferably proline or hydroxyproline. 
Typically, a collagen domain contains at least about 3 to 5 G-X-Y repeats, and can contain 

15 about 3, 5, 8, 10, 12, 15, 20 or more continuous G-X-Y repeats. In one embodiment, a 
collagen domain can fold to form a triple helical structure. 

In one embodiment, a TANGO 253 family member includes at least one collagen 
domain having an amino acid sequence that is at least about 40%, 50%, 60%, 70%, 80%, 
90%, 95% or 98% identical to amino acids 36 to 95 of SEQ ID NO:3, which is the 

20 collagen domain of human TANGO 253 (SEQ ID NO:6), or amino acids 36 to 95 of SEQ 
ID NO:10, which is the collagen domain of mouse TANGO 253 (SEQ ID NO:13), while 
maintaining a glycine residue at the first position of G-X-Y repeats within the domain to 
maintain at least 3, 5, 8, 10, 12, 15 or 20 contiguous G-X-Y repeats, or while most 
preferably maintaining a glycine repeat at the first position of each G-X-Y repeat within 

25 the domain. 

TANGO 253 family members can also include a Clq domain or at least one of the 
conserved amino acid motifs found therein. As used herein, the term "Clq domain" refers 
to a protein domain that bears homology to a Clq domain present within a member of the 
CI enzyme complex. A Clq domain typically includes about 130-140 amino acid 

30 residues. C 1 q domains are utilized in processes involving, e.g. , correct protein folding and 
alignment and protein-protein interactions. 

In one embodiment, a TANGO 253 family member includes one or more Clq 
domains having an amino acid sequence that is at least 45%, preferably about 50%, 55%, 
60%, 70%, 75%, 80%, 90%, 95% and most preferably at least about 98% identical to 

35 amino acids 105 to 232 of SEQ ID NO:3, which is the human TANGO 253 Clq domain 



-27- 



WO 00/78808 



PCT/US00/16883 



(SEQ ED NO:7) or amino acids 105 to 232 of SEQ ID NO: 10, which is the mouse 
TANGO 253 Clq domain (SEQ ID NO:14). 

Embodiments of TANGO 253 family members include, but are not limited to, 
human, mouse and rat TANGO 253 nucleic acids and proteins. The features of the human 

5 and mouse TANGO 253 are described below. A cDNA encoding a rat TANGO 253 
nucleotide sequence (SEQ ID NO:74), identified in clone jtrxaOOl el Otl, is 75.4% 
identical to human TANGO 253 (SEQ ID NO:l) in a 536 bp overlap. Further, the isolated 
rat TANGO 253 nucleotide sequence (SEQ ID NO:74) is 86% identical to mouse TANGO 
253 (SEQ ID NO:9) in a 472 bp overlap. 

1 0 Embodiments of TANGO 257 family members include, but are not limited to, 

human, mouse and rat TANGO 257 nucleic acids and proteins. The features of the human 
and mouse TANGO 257 are described below. A cDNA encoding a rat TANGO 257 
nucleotide sequence (SEQ ID NO:75), identified within clone jtrxal02g06tl, is 83.8% 
identical to human TANGO 257 (SEQ ED NO: 1 5) in a 734 bp overlap. Further, the 

1 5 isolated rat TANGO 257 nucleotide sequence (SEQ ED NO:75) is 88.4% identical to 
mouse TANGO 257 (SEQ ED NO:21) in a 73 1 bp overlap. 

In one example, a TANGO 257 family member includes one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 257 protein contains cytoplasmic 

20 domains of about amino residues 1 to 202 of SEQ ED NO: 1 7 (SEQ ED NO:84) and about 
amino acid residues 338 to 406 of SEQ ED NO: 17 (SEQ ID NO:92), transmembrane 
domains of about amino acid residues 203 to 221 of SEQ ID NO:17 (SEQ ID NO:86) and 
about amino acid residues 321 to 337 of SEQ ED NO: 17 (SEQ ED NO:87), and an 
extracellular domain of about amino acid residues 222 to 320 of SEQ ID NO: 17 (SEQ ID 

25 NO:88). In an alternative embodiment, a TANGO 257 protein contains an extracellular 
domain of about amino acid residues 1 to 320 of SEQ ID NO: 1 7 (SEQ ID NO:89) or a 
mature extracellular domain of about amino acid residues 22 to 320 of SEQ ED NO: 17 
(SEQ ID NO:90), a transmembrane domain of about amino acid residues 321 to 337 of 
SEQ ED NO: 17 (SEQ ID NO:87), and a cytoplasmic domain of about amino acid residues 

30 338 to 406 of SEQ ED NO: 1 7 (SEQ ID NO:92). In another embodiment, a mature 
TANGO 257 protein contains about amino acid residues 22 to 406 of SEQ ID NO: 17 
(SEQ ED NO: 18). 

In another embodiment, a TANGO 257 protein contains intracellular domains of 
about amino acid residues 1 to 202 of SEQ ID NO:23 (SEQ ED NO:93) and about amino 
35 acid residues 338 to 406 of SEQ ID NO:23 (SEQ ED NO:94), transmembrane domains of 
about amino acid residues 203 to 221 of SEQ ED NO:23 (SEQ ED NO:95) and about 
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amino acid residues 32 1 to 337 of SEQ ID NO:32 (SEQ ID NO:96), and an extracellular 
domain of about amino acid residues 222 to 320 of SEQ ID NO:23 (SEQ ID NO:97). In 
alternative embodiment, a TANGO 257 protein contains an extracellular domain of about 
amino acid residues 1 to 320 of SEQ ID NO:23 (SEQ ID NO:98) or a mature extracellular 

5 domain of about amino acid residues 22 to 320 of SEQ ID NO:23 (SEQ ID NO:99), a 
transmembrane domain of about amino acid residues 321 to 337 of SEQ ID NO:25 (SEQ 
ID NO:96), and an intracellular domain of about amino acid residues 338 to 406 of SEQ 
ID NO:23 (SEQ ID NO:94). In another embodiment, a mature TANGO 257 protein 
contains about amino acid residues 22 to 406 of SEQ ID NO:23 (SEQ ID NO:24). 

1 0 In another example, an INTERCEPT 258 family member includes one or more of 

the following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) 
a cytoplasmic domain. Thus, in one embodiment, an INTERCEPT 258 protein contains 
extracellular domains of about amino acid residues 1 to 206 of SEQ ID NO:28 (SEQ ID 
NO:81) or about amino acid residues 30 to 206 of SEQ ED NO: 28 (SEQ ID NO:76) and 

1 5 about amino acid residues 272 to 370 of SEQ ID NO: 28 (SEQ ID NO:34), 

transmembrane domains of about amino acid residues 207 to 224 of SEQ ID NO:28 (SEQ 
ID NO:78) and about amino acid residues 247 to 271 of SEQ ID NO:28 (SEQ ID NO:33), 
and a cytoplasmic domain of about amino acid residues 225 to 246 of SEQ ID NO:28 
(SEQ ID NO:79). In an alternative embodiment, an INTERCEPT 258 protein contains an 

20 extracellular domain of about amino acid residues 272 to 370 of SEQ ID NO:28 (SEQ ID 
NO:34), a transmembrane domain of about amino acid residues 247 to 271 of SEQ ID 
NO:28 (SEQ ID NO:33), and a cytoplasmic domain of about amino acid residues 1 to 246 
of SEQ ID NO:28 (SEQ ID NO:31) or a mature cytoplasmic domain of about amino acid 
residues 30 to 246 of SEQ ID NO:28 (SEQ ID NO:82). In accordance with these 

25 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
transmembrane and cytoplasmic domain of about amino acids 30 to 370 of SEQ ID NO:28 
(SEQEDNO:29). 

In another embodiment, an INTERCEPT 258 protein contains an extracellular 
domain of about amino acids 1 to 249 of SEQ ID NO:39 (SEQ ID NO:42), or a mature 

30 extracellular domain of about amino acids 30 to 249 of SEQ ID NO:39 (SEQ ID NO:83). 
In another embodiment, an INTERCEPT 258 protein contains a transmembrane domain of 
about amino acids 250 to 274 of SEQ ID NO:39 (SEQ ID NO:44). In another 
embodiment, an INTERCEPT 258 protein contains a cytoplasmic domain of about amino 
acids 275 to 394 of SEQ ID NO:39 (SEQ ID NO:45). In accordance with these 

35 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
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transmembrane and cytoplasmic domain of about 30 to 394 of SEQ ID NO:39 (SEQ ID 
NO:40). 

INTERCEPT 258 family members can also include an immunoglobulin (Ig) 
domain contained within the extracellular domain. As used herein, the term "Ig domain" 

5 refers to a protein domain bearing homology to immunoglobulin superfamily members. 
An Ig domain includes about 30-90 amino acid residues, preferably about 40-80 amino 
acid residues, more preferably about 50-70 amino acid residues, still more preferably 
about 55-65 amino acid residues, and most preferably about 57 to 59 amino acid residues. 
In certain embodiments, an Ig domain contains a conserved cysteine residue within about 

10 5 to 15 amino acid residues, preferably about 7 to 12 amino acid residues, and most 
preferably about 8 amino acid residues from its N-terminal end, and another conserved 
cysteine residue within about 1 to 5 amino acid residues, preferably about 2 to 4 amino 
acid residues, and most preferably about 3 amino acid residues from its C-terminal end. 
An Ig domain typically has the following consensus sequence, beginning about 1 

15 to 15 amino acid residues, more preferably about 3 to 10 amino acid residues, and most 
preferably about 5 amino acid residues from the C terminal end of the domain: (FY)-Xaa- 
C-Xaa-(VA)-COO-, wherein (FY) is either a phenylalanine or a tyrosine residue 
(preferably tyrosine), where "Xaa" is any amino acid, C is a cysteine residue, (VA) is 
either a valine or an alanine residue (preferably alanine), and COO- is the protein C 

20 terminus. 

In one embodiment, an INTERCEPT 258 family member includes one or more Ig 
domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 49 to 128 and/or amino acids 

25 1 67 to 226 of SEQ ID NO:28, which are the Ig domains of human INTERCEPT 258 
(these Ig domains are also represented as SEQ ID NO:35 and 36, respectively). 

In another embodiment, an INTERCEPT 258 family member includes one or more 
Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 

30 and most preferably at least about 95% identical to amino acids 167 to 226 of SEQ ID 
NO:28 (SEQ ID NO:36), includes a conserved cysteine residue about 8 residues 
downstream from the N-terminus of the Ig domain, and has one or more Ig domain 
consensus sequences described herein. In another embodiment, an INTERCEPT 258 
family member includes one or more Ig domains having an amino acid sequence that is at 

35 least 55%, preferably at least about 65%, more preferably at least about 75%, yet more 
preferably at least about 85%, and most preferably at least about 95% identical to amino 
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acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36), includes a conserved cysteine 
residue 8 residues downstream from the N-terminus of the Ig domain, has one or more Ig 
domain consensus sequences described herein, and has a conserved cysteine within the 
consensus sequence that forms a disulfide both with said first conserved cysteine. In yet 

5 another embodiment, an INTERCEPT 258 family member includes one or more Ig 

domains having an amino acid sequence that is at least 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 167 to 226 of SEQ ID NO:28 (SEQ 
ID NO:36), includes a conserved cysteine residue 8 residues downstream from the N- 

1 0 terminus of the Ig domain, has one or more Ig domain consensus sequences described 
herein, has a conserved cysteine within the consensus sequence that forms a disulfide both 
with said first conserved cysteine, and has at least one INTERCEPT 258 biological 
activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

1 5 acid sequence of SEQ ID NO:28 wherein the aforementioned Ig conserved residues are 
located as follows: the N-terminal conserved cysteine residue is located at about amino 
acid position 174 (within the Ig domain SEQ ID NO:36) and the C-terminal conserved 
cysteine is located at about amino acid position 224 (within the Ig domain SEQ ID 
NO:36). 

20 In another embodiment, an INTERCEPT 258 family member includes one or more 

Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 
and most preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID 
NO:39, which is the Ig domain of mouse INTERCEPT 258 (SEQ ID NO:71). In another 

25 embodiment, an INTERCEPT 258 family member includes one or more Ig domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID NO:39 (SEQ 
ID NO:71), includes a conserved cysteine residue about 8 residues downstream from the 

30 N-terminus of the Ig domain, and has one or more Ig domain consensus sequences 
described herein, has a conserved cysteine within the consensus sequence that forms a 
disulfide both with said first conserved cysteine, and has at least one INTERCEPT 258 
biological activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

35 acid sequence of SEQ ID NO:39 wherein the aforementioned Ig domain conserved 
residues are located as follows: the N-terminal conserved cysteine residue is located at 
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about amino acid residue position 177 (within the Ig domain SEQ ID NO:71) and the C- 
terminal conserved cysteine residue is located at about amino acid position 227 (within the 
Ig domain SEQflDNO:71). 

In another example, a TANGO 28 1 family member consists of one or more of the 

5 following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 281 protein contains an extracellular 
domain at amino acids 1 to about 123 of SEQ ID NO:48 or a mature extracellular domain 
at about amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID NO:51), a 
transmembrane domain at about amino acid residues 124 to 148 of SEQ ID NO:48 (SEQ 

10 ID NO:52), and a cytoplasmic domain at about amino acid residues 149 to 245 of SEQ ID 
NO:48 (SEQ ID NO:53). In another embodiment, a mature TANGO 281 protein contains 
about amino acid residues 39 to 245 of SEQ ID NO: 48 (SEQ ID NO: 50). In another 
embodiment, a TANGO 281 family contains an extracellular domain at amino acids 1 to 
about 1 12 of SEQ ID NO:58 or a mature extracellular domain at about amino acid residues 

15 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:61), a transmembrane domain at about amino 
acid residues 1 13 to 137 of SEQ ID NO:78 (SEQ ID NO:62), and a cytoplasmic domain at 
about amino acid residues 138 to 213 of SEQ ID NO:78 (SEQ ID NO:63). In yet another 
embodiment, a mature TANGO 281 protein contains about amino acid residues 27 to 213 
of SEQ ID NO: 58 (SEQ ID NO: 61). 

20 In one embodiment, a TANGO 281 family member includes a signal sequence. In 

a preferred embodiment, a TANGO 281 family member has the amino acid sequence of 
SEQ ID NO:48, and the signal sequence is located at about amino acids 1 to 38. In an 
another preferred embodiment, a TANGO 281 family member has the amino acid 
sequence of SEQ ID NO:58, and the signal sequence is located at about amino acids 1 to 

25 26. 

A photosystem II lOkd phosphoprotein (PSBH) domain has been identified in the 
TANGO 28 1 proteins. The domain is also present in the chloroplast gene PSBH that 
encodes a 9-10kDa thylakoid membrane protein (PSH-H) which is associated with 
photosystem II. In one embodiment, a TANGO 281 family member includes one or more 

30 PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 41 to 90 and/or amino acids 
127 to 182 of SEQ ID NO:48, which are the PSBH domains of human TANGO 281 (these 
PSBH domains are also represented as SEQ ID NO:54 and 55, respectively). In another 

35 embodiment, a TANGO 281 family member includes one or more PSBH domains having 
an amino acid sequence that is at least about 55%, preferably at least about 65%, more 
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preferably at least about 75%, yet more preferably at least about 85%, and most preferably 
at least about 95% identical to amino acids 41 to 90 and/or amino acids 127 to 182 of SEQ 
ID NO:48, which are the PSBH domains of human TANGO 281 (these PSBH domains are 
also represented as SEQ ID NO:54 and 55, respectively), includes one or more PSBH 

5 domain consensus sequences described herein, and has at least one TANGO 28 1 
biological activity as described herein. 

In another embodiment, a TANGO 281 family member includes one or more 
PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 

1 0 most preferably at least about 95% to 98% identical to amino acids 42 to 91 and/or amino 
acids 128 to 183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 
(these PSBH domains are also represented as SEQ ID NO:64 and 65, respectively). In 
another embodiment, a TANGO 281 family member includes one or more PSBH domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 

1 5 more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 42 to 91 and/or amino acids 128 to 
183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 (these PSBH 
domains are also represented as SEQ ID NO:64 and 65, respectively), includes one or 
more PSBH domain consensus sequences described herein, and has at least one TANGO 

20 281 biological activity as described herein. 

Various features of human and mouse TANGO 253, TANGO 257, INTERCEPT 
258 and TANGO 281 are summarized below. 

Human TANGO 253 

25 A cDNA encoding human TANGO 253 was identified by analyzing the sequences 

of clones present in a coronary artery smooth muscle library for sequences that encode 
secreted proteins. The primary cells utilized in construction of the library had been 
stimulated with agents that included phorbol 12-myristate 13-acetate (PMA), tumor 
neurosis factor (TNF), ionomycin, and cyclohexamide (CHX). This analysis led to the 

30 identification of a clone, Athma27h9, encoding full-length human TANGO 253. The 
human TANGO 253 cDNA of this clone is 1339 nucleotides long (Figures 1A-1B; SEQ 
ID NO:l). The open reading frame of this cDNA, nucleotides 188 to 916 of SEQ ID NO:l 
(SEQ ID NO:2), encodes a 243 amino acid secreted protein (Figures 1 A- IB; SEQ ID 
NO:3). 

35 Figure 2 depicts a hydropathy plot of human TANGO 253. Relatively 

hydrophobic regions of the protein are shown above the horizontal line, and relatively 
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hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; SEQ ID 
NO:5) on the left from the mature protein (amino acids 15 to 243 of SEQ ID NO:3; SEQ 

5 ID NO:4) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that human TANGO 253 includes a 15 amino acid signal 
peptide (amino acid 1 to amino acid 15 of SEQ ID NO:3; SEQ ID NO:5) preceding the 
mature human TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 

1 0 SEQ ID NO:3; SEQ ID NO:4). The molecular weight of TANGO 253 protein without 
post-translational modifications is 25.3 kDa prior to the cleavage of the signal peptide, 
23.8 kDa after cleavage of the signal peptide. 

Human TANGO 253 includes a collagen domain (at about amino acids 36 to 95 of 
SEQ ID NO:3; SEQ ED NO:6) and a Clq domain (at about amino acids 105 to 232 of SEQ 

1 5 ID NO:3; SEQ ID NO:7) containing 23 G-X-Y repeats. An RGD cell attachment site is 
found at amino acids 77 to 79 of SEQ ID NO:3. 

Three protein kinase C phosphorylation sites are present in human TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO:3), the second 
has the sequence TGK (at amino acids 140 to 142 of SEQ ID NO:3), and the third has the 

20 sequence SEC (at amino acids 220 to 222 of SEQ ID NO:3). Human TANGO 253 has 
three N-myristylation sites. The first has the sequence GLAAGS (at amino acids 1 1 to 16 
of SEQ ID NO:3), the second has the sequence GGRPGL (at amino acids 68 to 73 of SEQ 
ID NO:3) and the third has the sequence GIYASI (at amino acids 216 to 221 of SEQ ID 
NO:3). 

25 Northern analysis of human TANGO 253 expression demonstrates strong 

expression in heart, lung, liver, kidney and pancreas, and moderate expression in brain, 
placenta and skeletal muscle. Liver expression reveals two human TANGO mRNA bands, 
one of approximately 1.3kb (which is the size observed in the other tissues) as well as a 
band at approximately lkb, which may be the result of an alternative splicing event. 

30 Secretion assays reveal a human TANGO 253 protein of approximately 30kDa. 

The secretion assays were performed as follows: 8xl0 5 293T cells were plated per well in 
a 6-well plate and the cells were incubated in growth medium (DMEM, 10% fetal bovine 
serum, penicillin/strepomycin) at 37°C, 5% C0 2 overnight. 293T cells were transfected 
with 2 \ig of full-length TANGO 253 inserted in the pMET7 vector/well and 10 ng 

35 LipofectAMINE (GIBCO/BRL Cat. # 1 8324-012) /well according to the protocol for 
GIBCO/BRL LipofectAMINE. The transfectant was removed 5 hours later and fresh 
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growth medium was added to allow the cells to recover overnight. The medium was 
removed and each well was gently washed twice with DMEM without methionine and 
cysteine (ICN Cat. # 16-424-54). 1 ml DMEM without methionine and cysteine with 50 
uCi Trans- 35 S (ICN Cat. # 51006) was added to each well and the cells were incubated at 

5 37 °C, 5% C0 2 for the appropriate time period. A 150 ul aliquot of conditioned medium 
was obtained and 150 ul of 2X SDS sample buffer was added to the aliquot. The sample 
was heat-inactivated and loaded on a 4-20% SDS-PAGE gel. The gel was fixed and the 
presence of secreted protein was detected by autoradiography. 

TANGO 253 exhibits homology to an adipocyte complement-mediated protein 

10 precursor and so may be involved in adipocyte function, e.g., may act as a signaling 

molecule for adipocyte tissue. Figure 6A shows an alignment of the human TANGO 253 
amino acid sequence (SEQ ID NO:3) with the human adipocyte complement-mediated 
protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
is a 38.7% overall amino acid sequence identity between human TANGO 253 and human 

1 5 adipocyte complement-mediated protein precursor. 

Figures 7A-7C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO: 32); GenBank 
Accession Number A141 7523) and the nucleotide sequence of human TANGO 253 (SEQ 
ID NO:i). The alignment shows a 29.1% overall sequence identity between the two 

20 nucleotide sequences. 

The human TANGO 253 nucleotide sequence was mapped to human chromosome 
11, between flanking markers Dl 1S1356 and Dl 1S924 using the Genebridge 4 Human 
Radiation hybrid mapping panel with CAAAGTGAGCTCATGCTCTCAC (SEQ ID 
NO: 193) as the forward primer and CTCTGGTCTTGGGCAGAAATC (SEQ ID NO: 194) 

25 as the reverse primer. 

Clone EpT253, which encodes human TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

30 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. § 1 1 2. 

Mouse TANGO 253 

35 A cDNA encoding mouse TANGO 253 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 253 probe from sciatic 
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nerve. This analysis led to the identification of a clone, AtmXalel075, encoding full- 
length mouse TANGO 253. The mouse TANGO 253 cDNA of this clone is 1263 
nucleotides long (Figures 3A-3B; SEQ ID NO:8). The open reading frame of this cDNA, 
nucleotides 135 to 863 of SEQ ED NO:8 (SEQ ID NO:9), encodes a 243 amino acid 

5 secreted protein (Figures 3 A-3B; SEQ ID NO: 1 0). 

Figure 4 depicts a hydropathy plot of mouse TANGO 253. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) are 
indicated by short vertical lines just below the hydropathy trace. The dashed vertical line 

1 0 separates the signal sequence (amino acid 1 to amino acid 1 5 of SEQ ID NO: 1 0; SEQ ID 
NO: 12) on the left from the mature protein (amino acid 16 to amino acid 243 of SEQ ID 
NO:10; SEQ ID NO:l 1) on the right. 

the signal peptide prediction program SIGNALP (Nielsen et al., 1 997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 253 includes a 15 amino acid signal 

1 5 peptide (amino acid 1 to amino acid 1 5 of SEQ ID NO: 1 0; SEQ ID NO: 1 2) preceding the 
mature mouse TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 
SEQ ID NO:10; SEQ ID NO:l 1). The molecular weight of mouse TANGO 253 protein 
without post-translational modifications is 25.4 kDa prior to the cleavage of the signal 
peptide, 23.9 kDa after cleavage of the signal peptide. 

20 Mouse TANGO 253 includes a collagen domain (at amino acids 36 to 95 of SEQ 

ID NO:10; SEQ ID NO:13) and a Clq domain (at amino acids 105-232 of SEQ ID NO:10; 
SEQ ID NO: 14). 

Three protein kinase C phosphorylation sites are present in mouse TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO: 10), the second 

25 has the sequence TGK (at amino acids 1 40 to 1 42 of SEQ ID NO: 1 0), and the third has the 
sequence SIK (at amino acids 220 to 222 of SEQ ID NO: 10). Mouse TANGO 253 has 
four N-myristylation sites. The first has the sequence GLVSGS (at amino acids 1 1 to 1 6 
of SEQ ID NO: 10), the second has the sequence GGRPGL (at amino acids 68 to 73 of 
SEQ ID NO:10), the third has the sequence GQSIAS (at amino acids 172 to 177 of SEQ 

30 ID NO: 1 0), and the fourth has the sequence GIYASI (at amino acids 2 1 6 to 22 1 of SEQ 
ID NO: 10). 

As shown in Figure 5, human TANGO 253 protein and mouse TANGO 253 
protein are 93.8% identical. Figure 6B shows an alignment of the mouse TANGO 253 
amino acid sequence (SEQ ED NO: 10) with the human adipocyte complement-mediated 
35 protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
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is a 38.3% overall amino acid sequence identity between mouse TANGO 253 and human 
adipocyte complement-mediated protein precursor. 

Figures 8A-8C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO:32); GenBank 

5 Accession Number A141 7523) and the nucleotide sequence of mouse TANGO 253 (SEQ 
ID NO:8). The alignment shows a 30.4% overall sequence identity between the two 
nucleotide sequences. 

In situ tissue screening was performed on mouse embryonic tissue (obtained from 
embryos at embryonic day 13.5 to postnatal day 1.5) and adult tissue to determine the 

1 0 expression of mouse TANGO 253 mRNA. Expression of mouse TANGO 253 during 
embryogenesis was ubiquitously expressed throughout the central nervous system. Strong 
expression of mouse TANGO 253 was detected in choriod plexus of the fourth ventricle 
of El 8.5 and El. 5 embryos examined. Expression of mouse TANGO 253 was also 
detected in the lungs of E14.5 and E15.5 embryos and in the kidneys of E15.5 embryos. 

1 5 Mouse TANGO 253 expression was detected by in situ hybridization in the 

following adult tissues: a signal was detected in the brain in the choroid plexus of the 
lateral and 4th ventricles, and the olfactory bulb; a signal was detected in the cortical 
region of the kidney consistent with the pattern of glomeruli (in particular, the cortical 
radial veins); a ubiquitous signal was detected in the thymus; a weak, ubiquitous signal 

20 was detected in the spleen; a moderate signal was associated with the seminiferous 
vesicles of the testes; a signal was detected in the ovaries; and a ubiquitous signal 
restricted to the zone of giant cells was detected in the placenta. 

Clone EpTm253, which encodes mouse TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

25 2209) on April 2 1 , 1 999 and assigned Accession Number 2072 1 5. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

30 

Uses of TA NGO 253 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 253 was originally found in the coronary artery smooth muscle library 
described above, TANGO 253 nucleic acids, proteins, and modulators thereof can be used 
to modulate the proliferation, development, differentiation, and/or function of organs, e.g., 
35 tissues and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 
connective tissue, e.g., abnormal coronary smooth muscle cells and/or endothelial cells of 
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blood vessels. TANGO 253 nucleic acids, proteins, and modulators thereof can also be 
used to modulate symptoms associated with abnormal coronary function, e.g., heart 
diseases and disorders such as atherosclerosis, coronary artery disease and plaque 
formation. 

5 In light of the collagen domain, TANGO 253 nucleic acids, proteins and 

modulators thereof can be utilized to modulate {e.g., stabilize, promote, inhibit or disrupt) 
cell/extracellular matrix (ECM) interactions, cell/cell interactions and, for example, signal 
transduction events associated with such interactions. For example, such TANGO 253 
compositions and modulators thereof can be used to modulate binding of such ECM- 

1 0 associated factors as integrin and can function to modulate ligand binding to cell surface 
receptors. In addition, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate connective tissue formation, maintenance and function, as well as to 
modulate symptoms associated with connective tissue-related disorders, to promote wound 
healing, and to reduce, slow or inhibit ameliorate connective tissue-related signs of aging, 

1 5 such as wrinkle formation. 

In light of the Clq domain exhibited by TANGO 253 proteins and their similarity 
to the collectin family, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate immune-related processes such as the ability to modulate host 
immune response by, e.g., modulating one or more elements in the serum complement 

20 cascade, including, for example activation of the cascade, formation of and/or binding to 
immune complexes, detection and defense against surface antigens and bacteria, and 
immune surveillance for rapid removal or pathogens. Such TANGO 253 compositions 
and modulators thereof can be utilized, e.g., to ameliorate incidence of any symptoms 
associated with disorders that involve such immune-related processes, including, but not 

25 limited to infection and autoimmune disorders. 

In addition, such compositions and modulators thereof can be utilized to modulate 
folding and alignment of the collagen domain (e.g., into a triple helix), disorders 
associated with collagen defects, including but not limited to bone disorders, e.g., bone 
resorption disorders, or hearing, e.g., inner ear, disorders, to modulate protein-protein 

30 interactions and recognition events (either homotypic or heterotypic) and cellular response 
events (e.g., signal transduction events) associated with such interactions and recognitions, 
and to ameliorate symptoms associated with abnormal signaling, protein-protein 
interaction and/or cellular response events including, but not limited to cell proliferation 
disorders such as cancer, abnormal neuronal interactions, such as disorders involving 

35 abnormal synaptic activity, e.g., abnormal Purkinje cell activities. 
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Human TANGO 253 protein contains an RGD domain. As such, TANGO 253 
nucleic acids, proteins and modulators thereof can be utilized to modulate processes 
involved in, e.g., bone development, sepsis, tumor progression, metastasis, cell migration, 
fertilization, and cellular interactions with the extracellular matrix required for growth, 
5 differentiation, and apoptosis, as well as cellular processes involving cell adhesion, such as 
cell migration. 

TANGO 253 proteins exhibit similarity to adipocyte complement-related protein 
precursor and can act as signaling molecules for adipocyte tissue. In light of this, TANGO 
253 nucleic acids, proteins and modulators thereof can be utilized to modulate adipocyte 
10 function and adipocyte-related processes and disorders such as, e.g., obesity. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
modulate the development, differentiation, maturation, proliferation and/or activity of cells 
of the central nervous system such as neurons, glial cells (e.g., astrocytes and 
oligodendrocytes), and Schwann cells. TANGO 253 nucleic acids, polypeptides, or 

1 5 modulators thereof can also be used to treat disorders of the brain, such as cerebral edema, 
hydrocephalus, brain herniations, iatrogenic disease (due to, e.g., infection, toxins, or 
drugs), inflammations (e.g., bacterial and viral meningitis, encephalitis, and cerebral 
toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, ischemia, and infarction, 
intracranial hemorrhage and vascular malformations, and hypertensive encephalopathy), 

20 tumors (e.g., neuroglial tumors, neuronal tumors, tumors of pineal cells, meningeal 

tumors, primary and secondary lymphomas, intracranial tumors, and medulloblastoma), 
and to treat injury or trauma to the brain. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

25 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

30 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

35 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
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TANGO 253 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 
involved in microglial, lung, liver, kidney, pancreas, brain, placental and skeletal muscle 
formation and activity, as well as in ameliorating any symptom associated with a disorder 
5 of such cell types, tissues and organs. 

TANGO 253 expression can be utilized as a marker (e.g., an in situ marker) for 
specific tissues (e.g., the brain) and/or cells (e.g., neurons) in which TANGO 253 is 
expressed. TANGO 253 nucleic acids can also be utilized for chromosomal mapping. 

10 Human TANGO 257 

A cDNA encoding human TANGO 257 was identified by analyzing the sequences 
of clones present in a coronary smooth muscle library for sequences that encode secreted 
proteins. This analysis led to the identification of a clone, Athma7cl0, encoding full- 
length human TANGO 257. The human TANGO 257 cDNA of this clone is 1832 

15 nucleotides long (Figures 9A-9B; SEQ ID NO:15). The open reading frame of this cDNA, 
nucleotides 88 to 1305 of SEQ ID NO: 15 (SEQ ID NO: 16), encodes a 406 amino acid 
secreted protein (Figures 9A-9B; SEQ ID NO: 1 7). 

Figure 10 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, and relatively hydrophilic 

20 regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

25 Engineering 10: 1-6) predicted that human TANGO 257 includes a 21 amino acid signal 
peptide (amino acid 1 to amino acid 21 of SEQ ID NO: 17; SEQ ID NO: 19) preceding the 
mature human TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of 
SEQ ID NO: 17; SEQ ID NO: 18). The molecular weight of human TANGO 257 protein 
without post-translational modifications is 46.0 kDa prior to the cleavage of the signal 

30 peptide, 43.8 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in human TANGO 257. The first has the 
sequence NDTA and is found at amino acids 177 to 180 of SEQ ID NO: 17, and the second 
has the sequence NRTV and is found at amino acids 248 to 251 of SEQ ID NO: 17. A 
cAMP and cGMP dependent protein kinase phosphorylation site having the sequence 

35 RKAS is found in human TANGO 257 at amino acids 196 to 199 of SEQ ID NO: 17. Five 
protein kinase C phosphorylation sites are present in human TANGO 257. The first has 
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the sequence SSR (at amino acids 48 to 50 of SEQ ID NO: 17), the second has the 
sequence SGR (at amino acids 84 to 86 of SEQ ID NO: 17), the third has the sequence 
SMK (at amino acids 144 to 146 of SEQ ID NO:17), the fourth has the sequence TEK (at 
amino acids 166 to 168 of SEQ ID NO:17) and the fifth has the sequence SLR (at amino 

5 acids 374 to 376 of SEQ ID NO: 1 7). Five casein kinase II phosphorylation sites are 

present in human TANGO 257. The first has the sequence TEAD (at amino acids 78 to 81 
of SEQ ID NO: 17), the second has the sequence TQND (at amino acids 175 to 178 of 
SEQ ID NO: 17), the third has the sequence TWD (at amino acids 250 to 253 of SEQ ID 
NO: 17), the fourth has the sequence TYID (at amino acids 272 to 275 of SEQ ED NO: 17), 

10 and the fifth has the sequence TRED (at amino acids 289 to 292 of SEQ ID NO: 1 7). 
Human TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 
RLEREVDY at amino acids 89 to 96 of SEQ ID NO: 17). Human TANGO 257 has three 
N-myristylation sites. The first has the sequence GGPGTK (at amino acids 1 15 to 120 of 
SEQ ID NO:17), the second has the sequence GGPAGL (at amino acids 152 to 157 of 

1 5 SEQ ID NO: 1 7) and the third has the sequence GAH ASL (at amino acids 3 70 to 375 of 
SEQ ID NO: 17). Human TANGO 257 has an amidation site having the sequence KGRR 
at amino acids 122 to 125 of SEQ ID NO:17. 

Northern analysis of human TANGO 257 expression demonstrates moderate 
expression in heart, liver and pancreas, and low expression in kidney, lung and skeletal 

20 muscle. 

Secretion assays reveal a human TANGO 257 protein of approximately 50kDa. 
The secretion assays were performed as described in the human TANGO 253 section 
above. 

The human TANGO 257 nucleotide sequence was mapped to human chromosome 
25 1 using the Genebridge 4 Human Radiation hybrid mapping panel with GGATGATGG 
CTACC AG ATTGTC (SEQ ID NO: 195) as the forward primer and 
GGAACATTGAGGGTTTTGACTC (SEQ ID NO: 196) as the reverse primer. 

TANGO 257 is homologous to a protein encoded by a nucleic acid sequence 
referred to in PCT Publication WO 98/39446 as "gene 64". Figure 14 shows an alignment 
30 of the human TANGO 257 amino acid sequence (SEQ ID NO: 17) with the gene 64 
encoded amino acid sequence (SEQ ID NO:43). As shown in the figure, the 353 amino 
acid gene 64 polypeptide is identical to amino acid residues 1-353 of human TANGO 257 
(SEQ ID NO: 17). Human TANGO 257 contains 406 amino acids, i.e., contains an 
additional 53 amino acid residues carboxy to residue 353. The overall amino acid 
35 sequence identity between full-length human TANGO 257 polypeptide and the gene 64- 
encoded polypeptide is approximately 87%. 
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Figures 15A-15D show an alignment of the nucleotide sequence of gene 64 (SEQ 
ID NO:66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 1 5). The nucleotide sequences of gene 64 and human TANGO 
257 are 93.5% identical. Among the differences between the sequences is a cytosine 

5 nucleotide at human TANGO 257 (SEQ ID NO: 1 5) position 1 587 that represents an 
insertion relative to the corresponding gene 64 position when the gene 64 and TANGO 
. 257 sequences are aligned. This additional cytosine results in the TANGO 257 open 
reading frame being 1218 base pairs encoding a polypeptide of 406 amino acid residues. 
In contrast, the gene 64 nucleic acid sequence encodes a polypeptide of only 353 amino 

10 acid residues, as discussed above. 

Clone EpT257, which encodes human TANGO 257, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

1 5 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

Mouse TANGO 257 

20 A cDNA encoding mouse TANGO 257 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 257 probe. This 
analysis led to the identification of a clone, Atmual02gbl, encoding full-length mouse 
TANGO 257, The mouse TANGO 257 cDNA of this clone is 1721 nucleotides long 
(Figures 1 1 A-l IB; SEQ ID NO:21). The open reading frame of this cDNA, nucleotides 

25 31 to 1248 of SEQ ID NO:21 (SEQ ID NO:22), encodes a 406 amino acid secreted protein 
(Figures 1 1 A-l IB; SEQ ID NO:23). 

Figure 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 

30 glycosylation sites are (Ngly) are indicated by short vertical lines just below the 

hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1 997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 257 includes a 21 amino acid signal 

35 peptide (amino acid 1 to amino acid 21 of SEQ ID NO:23; SEQ ID NO:25) preceding the 
mature TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of SEQ 
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ID NO:23; (SEQ ID NO:24). The molecular weight of mouse TANGO 257 protein 
without post-translational modifications is 45.8 kDa prior to the cleavage of the signal 
peptide, 43.6 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in mouse TANGO 257. The first has the 
5 sequence NDTA and is found at amino acids 1 77 to 1 80 of SEQ ID NO:23, and the second 
has the sequence NRTV and is found at amino acids 248 to 251 of SEQ ID NO:23. A 
cAMP and cGMP-dependent protein kinase phosphorylation site having the sequence 
RKAS is found in mouse TANGO 257 at amino acids 196 to 199 of SEQ ID NO:23. Five 
protein kinase C phosphorylation sites are present in mouse TANGO 257. The first has 
1 0 the sequence SSR (at amino acids 48 to 50 of SEQ ID NO:23), the second has the 
sequence TLR (at amino acids 75 to 77 of SEQ ID NO:23), the third has the sequence 
SGR (at amino acids 84 to 86 of SEQ ID NO:23), the fourth has the sequence SMK (at 
amino acids 144 to 146 of SEQ ID NO:23) and the fifth has the sequence SLR (at amino 
acids 374 to 376 of SEQ ID NO:23). Five casein kinase II phosphorylation sites are 

1 5 present in mouse TANGO 257. The first has the sequence TEAD (at amino acids 78 to 8 1 
of SEQ ID NO:23), the second has the sequence TQND (at amino acids 175 to 178 of 
SEQ ID NO:23), the third has the sequence TVVD (at amino acids 250 to 253 of SEQ ID 
NO:23), the fourth has the sequence TYBD (at amino acids 272 to 275 of SEQ ID NO:23), 
and the fifth has the sequence TRRD (at amino acids 289 to 292 of SEQ ID NO:23). 

20 Mouse TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 

RLEREVDY at amino acids 89 to 96 of SEQ ID NO:23. Mouse TANGO 257 has four N- 
myristylation sites. The first has the sequence GGPGAK (at amino acids 1 15 to 120 of 
SEQ ID NO:23), the second has the sequence GGSVGL (at amino acids 151 to 157 of 
SEQ ID NO:23), the third has the sequence GGPGGG (at amino acids 227 to 232 of SEQ 

25 ID NO:23), and the fourth has the sequence GAHASL (at amino acids 370 to 375 of SEQ 
ID NO:23). Mouse TANGO 257 has an amidation site having the sequence KGRR at 
amino acids 122 to 125 of SEQ ID NO:23. 

As shown in Figure 13, human TANGO 257 protein and mouse TANGO 257 
protein are 94.1% identical. 

30 Figure 1 6 shows an alignment of mouse TANGO 257 amino acid sequence (SEQ 

ED NO:23) with the amino acid sequence encoded by gene 64 (SEQ ID NO:43). As 
shown in the figure, the 253 amino acid gene 64 polypeptide and the 406 amino acid 
mouse TANGO 257 polypeptide and the 406 amino acid mouse TANGO 257 polypeptide 
are approximately 82% identical. Figures 17A-17C show an alignment of the nucleotide 

35 sequence of gene 64 (SEQ ID NO:66; PCT publication no. 98/39446) and the nucleotide 
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sequence of mouse TANGO 257 (SEQ ID NO:21). As shown in the figure, the two 

nucleotide sequences are approximately 76% identical. 

In situ tissue screening was performed on mouse adult tissues and embryonic 

tissues (obtained from embryos E13.5 to P1.5) to analyze for the expression of mouse 
5 TANGO 257 mRNA. Mouse TANGO 257 expression was detected the following adult 

tissues: the submandibular gland; the renal papilla region of the kidney; the capsule region 

of the adrenal gland; and the labyrinth zone of the placenta. 

In the case of embryonic expression, mouse TANGO 257 expression was detected 

in the bones, lungs, intestines, and kidneys. At E13.5, a signal was detected in many 
1 0 tissues including the developing bone structures such as the vertebrae, of the spinal 

column, jaw, and scapula. At E14.5, the signal pattern was very similar to that detected at 

E13.5. At 15.5, a signal was detected in all major bone structures, including the skull, 

basisphenoid bone, upper and lower incisor teeth, vertebral column, sternum, scapula, and 

femur. A ubiquitous signal was also detected in the lung, kidney, and intestinal tract. At 
15 16.5 and 18.5, the signal is very similar to that detected at E15.5. At PI .5, a signal was 

still detected in all of the major bone structures and signal detected in the lung, kidney, 

and intestines has dropped to nearly background levels. 

Clone EpTm257, which encodes mouse TANGO 257, was deposited with the 

American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
20 2209) on April 21, 1999 and assigned Accession Number 2071 17. This deposit will be 

maintained under the terms of the Budapest Treaty on the International Recognition of the 

Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 

merely as a convenience for those of skill in the art and is not an admission that a deposit 

is required under 35 U.S.C. §112. 

25 

Uses of TANGO 257 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 257 was originally found in a coronary artery smooth muscle library, 
TANGO 257 nucleic acids, proteins, and modulators thereof can be used to modulate the 
proliferation, development, differentiation, and/or function of organs, e.g., heart, tissues 

30 and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 

connective tissue, e.g., coronary smooth muscle cells and/or endothelial cells of blood 
vessels. TANGO 257 nucleic acids, proteins, and modulators thereof can also be used to 
modulate symptoms associated with abnormal coronary function, e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plaque formation. 

35 In light of TANGO 257's homology to the extracellular molecule olfactomedin, 

TANGO 257 nucleic acids, proteins and modulators thereof can be utilized to modulate 
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development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons and to modulate neuronal activities involving maintenance, growth and/or 
differentiation of chemosensory cilia, modulate cell-cell interactions and cell-ECM 
interactions, e.g., neuronal (such as olfactory) cell-ECM interactions. TANGO 257 

5 nucleic acids, proteins and modulations thereof can also be used to modulate symptoms 
associated with abnormal processes involving such cells and/or activities, for example 
neuronal function, e.g., neurological disorders, neurodegenerative disorders, 
neuromuscular disorders, cognitive disorders, personality disorders, and motor disorders, 
and chemosensory disorders, such as olfactory-related disorders. 

10 TANGO 257 exhibits homology to a gene referred to as "gene 64" (PCT 

Publication No. WO 98/39446), which is expressed primarily in fetal lung tissue. In light 
of this, TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate development, differentiation, proliferation and/or activity of pulmonary system 
cells, e.g., lung cell types, and to modulate a symptom associated with disorders of 

15 pulmonary development, differentiation and/or activity, cystic fibrosis. TANGO 257 
nucleic acids, proteins and modulators thereof can also be used to modulate symptoms 
associated with abnormal pulmonary development or function, such as lung diseases or 
disorders associated with abnormal pulmonary development or function, e.g., cystic 
fibrosis. TANGO 257 nucleic acids, polypeptides, or modulators thereof can be used to 

20 treat pulmonary (lung) disorders, such as atelectasis, cystic fibrosis, rheumatoid lung 
disease, pulmonary congestion or edema, chronic obstructive airway disease (e.g., 
emphysema, chronic bronchitis, bronchial asthma, and bronchiectasis), diffuse interstitial 
diseases (e.g., sarcoidosis, pneumoconiosis, hypersensitivity pneumonitis, bronchiolitis, 
Goodpasture's syndrome, idiopathic pulmonary fibrosis, idiopathic pulmonary 

25 hemosiderosis, pulmonary alveolar proteinosis, desquamative interstitial pneumonitis, 
chronic interstitial pneumonia, fibrosing alveolitis, hamman-rich syndrome, pulmonary 
eosinophilia, diffuse interstitial fibrosis, Wegener's granulomatosis, lymphomatoid 
granulomatosis, and lipid pneumonia), or tumors (e.g., bronchogenic carcinoma, 
bronchiolovlveolar carcinoma, bronchial carcinoid, hamartoma, and mesenchymal 

30 tumors). 

TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate cell proliferation, e.g., abnormal cell proliferation. Such modulation may, for 
example, be via modulation of one or more elements involved in signal transduction 
cascades. 

35 TANGO 257 nucleic acids, proteins and modulators thereof can also be utilized to 

modulate the development, differentiation, maturation, proliferation and/or activity of 
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bone cells such as osteocytes, and to treat bone associated diseases or disorders. Examples 
of bone diseases and disorders include bone injury due to for example, trauma (e.g., bone 
breakage, cartilage tearing), degeneration (e.g., osteoporosis), degeneration of joints, e.g., 
arthritis, e.g., osteoarthritis, and bone wearing. Further, TANGO 257 nucleic acids, 

5 proteins and modulators thereof can be utilized to modulate or regulate the development of 
bone structures such as the skull, the basisphenoid bone, the upper and lower incisor teeth, 
the vertebral column, the sternum, the scapula, and the femur during embryogenesis. 

TANGO 257 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 

10 involved in microglial, liver, kidney, and skeletal muscle formation and activity, as well 
as in ameliorating a symptom associated with a disorder of such cell types, tissues and 
organs. 

TANGO 257 nucleic acids, polypeptides, or modulators thereof can also be used to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

1 5 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

20 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

25 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
TANGO 257 polypeptides, nucleic acids, or modulators thereof can be used to treat 
intestinal disorders, such as ischemic bowel disease, infective enterocolitis, Crohn's 
disease, benign tumors, malignant tumors (e.g., argentafHnomas, lymphomas, 

30 adenocarcinomas, and sarcomas), malabsorption syndromes (e.g., celiac disease, tropical 
sprue, Whipple's disease, and abetalipoproteinemia), obstructive lesions, hernias, 
intestinal adhesions, intussusception, or volvulus. 

Further, TANGO 257 expression can be utilized as a marker (e.g. an in situ 
marker) for specific tissues (i.e., bone structures) and/or cells (i.e., osteocytes) in which 

35 TANGO 257 is expressed. TANGO 257 nucleic acids can also be used for chromosomal 
mapping. 
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Human INTERCEPT 258 

A cDNA encoding human INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a human mixed lymphocyte reaction library for sequences 
that encode secreted proteins. This analysis led to the identification of a clone, Athlxtce, 
5 encoding full-length human INTERCEPT 258. The human INTERCEPT 258 cDNA of 
this clone is 1869 nucleotides long (Figures 1 8A-1 8B; SEQ ID NO:26). The open reading 
frame of this cDNA, nucleotides 153 to 1262 of SEQ ID NO:26 (SEQ ID NO:27), encodes 
a 370 amino acid transmembrane protein (Figures 18A-18B; SEQ ED NO:28). 

Figure 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 
10 hydrophobic regions of the protein are shown above the horizontal line, and relatively 
hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 29 of SEQ ID NO:28; SEQ ID 
NO:30) on the left from the mature protein (amino acids 30 to 370 of SEQ ID NO:28; 
1 5 SEQ ID NO:29) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that human INTERCEPT 258 includes a 29 amino acid 
signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:26; SEQ ID NO:30) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 
20 acid 370 of SEQ ID NO:26; SEQ ID NO:29). The molecular weight of human 

INTERCEPT 258 protein without post-translational modifications is 40.0 kDa prior to the 
cleavage of the signal peptide, 37.0 kDa after cleavage of the signal peptide. 

Human INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids amino acids 207 to 224 of SEQ ID NO:28 (SEQ ID NO:78) and amino acids 247 to 
25 27 1 of SEQ ID NO:28 (SEQ ID NO:33). Human INTERCEPT 258 also contains two Ig 
domains, one at amino acids 49 to 128 of SEQ ID NO:28 (SEQ ID NO:35) and a second at 
amino acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36). 

Five N-glycosylation sites are present in human INTERCEPT 258. The first has 
sequence NLSL and is found at amino acids 108 to 1 1 1 of SEQ ID NO:28, the second has 
30 the sequence NUTL and is found at amino acids 1 69 to 1 72 of SEQ ED NO:28; the third is 
has the sequence NLSS and is found at amino acids 213 to 216 of SEQ ID NO:28, the 
fourth has the sequence NUTL and is found at amino acids, 236 to 239 of SEQ ID NO:28, 
and the fifth has die sequence NGTL and is found at amino acids 307 to 3 1 0 of SEQ ID 
NO:28. Seven protein kinase C phosphorylation sites are present in human DMTERCEPT 
35 258. The first has the sequence TSK and is found at amino acids 93 to 95 of SEQ ID 
NO:28, the second has the sequence SLR and is found at amino acids 1 10 to 1 12 of SEQ 
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ID NO:28, the third has the sequences SIK and is found at amino acids 141 to 143 or SEQ 
ID NO:28, the fourth has the sequence SCR and is found at amino acids 157 to 159, the 
fifth has the sequence SPR and is found at amino acids 176 to 179 of SEQ ID NO:28, the 
sixth has the sequence SAR and is found at amino acids 315 to 317 of SEQ ID NO:28, and 

5 the seventh has the sequence SPR and is found at amino acids 344 to 346 of SEQ ID 

NO:28. The human INTERCEPT 258 protein has seven N-myristoylation sites. The first 
has the sequence GUTTSK and is found at amino acids 90 to 95 of SEQ ID NO:28, the 
second has the sequence GANVTL and is found at amino acids 167 to 172 of SEQ ID 
NO:28, the third has the sequence GVYVCK and is found at amino acids 220 to 225, the 

10 fourth has the sequence GTAQCN and is found at amino acids 231 to 236 of SEQ ID 
NO:28, the fifth has the sequence GTLVGL and is found at amino acids 256 to 261, the 
sixth has the sequence GLLAGL and is found at amino acids 262 to 267 of SEQ ID 
NO:28, and the seventh has the sequence GTLSSU and is found at acids 308 to 313 of 
SEQIDNO:28. 

1 5 The human INTERCEPT 258 gene was mapped to human chromosome 1 1 using 

Genebridge 4 Human Radiation hybrid mapping panel with 
GGAGTATCCTTGGTCTACTCC (SEQ ID NO: 197) as the forward primer and 
GAAAGTCTGGAAGGATGGAAGCT (SEQ ID NO: 198) as the reverse primer. 

Human multi-tissue dot blot analysis of human INTERCEPT 258 expression 

20 demonstrates strongest expression in lung, fetal lung, placenta, thyroid gland and 

mammary gland. Moderate expression is observed in heart, aorta, kidney, small intestine, 
fetal heart, fetal kidney, fetal spleen, uterus, and stomach. Weak expression is observed in 
whole brain, amygdala, caudate nucleus, cerebellum, cerebral cortex frontal lobe, 
hippocampus, medulla oblongata, occipital lobe, putamen, substantia nigra, temporal lobe, 

25 thalamus, acumens, spinal cord, skeletal muscle, colon, bladder, prostate, ovary, pancreas, 
pituitary gland, adrenal gland, salivary gland, liver, spleen, thymus, lymph node, bone 
marrow, appendix, trachea, fetal brain, fetal liver, and fetal thymus. 

A human cancer cell line Northern blot analysis showed a roughly 2.0 kb 
INTERCEPT 258 band only in the lane containing cell line Chronic Myelogenous 

30 Leukemia (K-562). The cancerous cell lines in which INTERCEPT 258 was not 
expressed include promyeocytic leukemia, Hela, lymphoblastic leukemia, Burkitt's 
lymphoma Raji, colorectal adenocarcinoma, lung carcinoma and melanoma. 

INTERCEPT 258 exhibits homology to a human A33 antigen. A33 antigen is a 
transmembrane glycoprotein and a member of the immunoglobulin superfamily that may 

35 represent a cancer cell marker (Heath et al., 1997, Proc. Natl. Acad. Sci. USA 94:469- 

474). Figure 23 shows an alignment of the human INTERCEPT 258 amino acid sequence 
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(SEQ ID NO:28) with the human A33 amino acid sequence (SEQ ID NO:67). The 
alignment shows that there is a 23.0% overall amino acid sequence identity between 
human INTERCEPT 258 and A33. Figures 24A-24D show an alignment of the human 
INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) with that of human A33 

5 nucleotide sequence (SEQ ID NO:68). The alignment shows that there is a 40.6% identity 
between the two sequences. 

Human INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) exhibits 
homology to human PECAM-1 nucleotide sequence (SEQ ED NO:72). Figures 27A-27E 
show that there is an overall 40.5% identity between the two nucleotide sequences. 

1 0 Human INTERCEPT 258 amino acid sequence (SEQ ID NO:28) and human PECAM- 1 
amino acid sequence (SEQ ID NO:73) share less than 18% identity. PECAM-1 (platelet 
endothelial cell adhesion molecule- 1) is an integrin expressed on endothelial cells. 

Clone EpT258, which encodes human INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

1 5 2209) on April 2 1 , 1 999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

20 

Mouse INTERCEPT 258 

A cDNA encoding mouse INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a mouse megakaryocyte library for sequences that encode 
secreted proteins. This analysis led to the identification of a clone, Athmeal7c8, encoding 

25 full-length mouse INTERCEPT 258. The mouse INTERCEPT 258 cDNA of this clone is 
1 846 nucleotides long (Figures 20A-20B; SEQ ID NO:37). The open reading frame of 
this cDNA, nucleotides 107 to 1288 of SEQ ID NO:37 (SEQ ID NO:38), encodes a 394 
amino acid transmembrane protein (Figures 20A-20B, SEQ ID NO:39). 

Figure 21 depicts a hydropathy plot for mouse INTERCEPT 258. Relatively 

30 hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

35 The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

Engineering 10:1-6) predicted that mouse INTERCEPT 258 includes a 29 amino acid 
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signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:39; SEQ ID NO:41) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 
acid 394 of SEQ ID NO:39; SEQ ID NO:40). The molecular weight INTERCEPT 258 
without post-translational modifications is 41.8 kDa prior to the cleavage of the signal 

5 peptide, 38.90 kDa after cleavage of the signal peptide. 

Mouse INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids 250 to 274 SEQ ED NO:39 (SEQ ID NO:44). Mouse INTERCEPT 258 also contains 
an Ig domain at amino acids 170 to 229 of SEQ ED NO:39 (SEQ ID NO:71). 

Five N-glycosylation sites are present in mouse INTERCEPT 258. The first has 

10 sequence NVSL and is found at amino acids 1 1 1 to 1 14 of SEQ ID NO:39, the second has 
the sequence NVTL and is found at amino acids 172 to 175 of SEQ ID NO:39, the third 
has the sequence NLSI and is found at amino acids 216 to 219 of SEQ ID NO:39, the 
fourth has the sequence NVTL and is found at amino acids, 239 to 242 of SEQ ID NO:39, 
and the fifth has the sequence NGTL and is found at amino acids 310 to 313 of SEQ ID 

15 NO:39. Nine protein kinase C phosphorylation sites are present in mouse INTERCEPT 
258. the first has the sequence TNK and is found at amino acids 96 to 98 of SEQ ID 
NO:39, the second has the sequence SSR and is found at amino acids 108 to 1 10 of SEQ 
ID NO:39, the third has the sequence SLR and is found at amino acids 1 13 to 1 15 of SEQ 
ID NO:39, the fourth has the sequence TYR and is found at amino acids 126 to 128, the 

20 fifth has the sequence SIK and is found at amino acids 144 to 146 of SEQ ID NO:39, the 
sixth has the sequence SPR and is found at amino acids 179 to 181 of SEQ ID NO:39, the 
seventh has the sequence SLK and is found at amino acids 211 and 213, the eighth has the 
sequence SAR and is found at amino acids 318 to 320 of SEQ ID NO:39, and the ninth 
has the sequence SPR and is found at amino acids 348 to 350 of SEQ ED NO:39. The 

25 mouse INTERCEPT 258 contains a casein kinase II phosphorylation site having the 
sequence TLEE, found at amino acids 280 to 283 of SEQ ID NO:39. The mouse 
INTERCEPT 258 protein has nine N-myristoylation sites. The first has the sequence 
GTPETS and is found at amino acids 6 to 1 1 of SEQ ID NO:39, the second has the 
sequence GVMTNK and is found at amino acids 125 to 130 of SEQ ID NO:39, the third 

30 has the sequence GTYRCS and is found at amino acids 125 to 130, the fourth has the 
sequence GTNVTL and is found at amino acids 170 to 175 of SEQ ID NO:39, the fifth 
has the sequence GVYVCK and is found at amino acids 223 to 228, the sixth has the 
sequence GSKAAV and is found at amino acids 247 to 252, the seventh has the sequence 
GAVVGT and is found at amino acids 255 to 260 of SEQ ID NO:39, the eighth has 

35 sequence GTLSSV and is found at amino acids 31 1 to 316 of SEQ ID NO:39, and the 
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ninth has the sequence GGVSSS and is found at amino acids 367 to 372 of SEQ ID 
NO:39. 

An in situ expression analysis of INTERCEPT 258 was performed as summarized 
herein. Mouse INTERCEPT 258 expression during embryogenesis (E73.5 to PI. 5 were 
5 examined) was observed throughout the animal in a punctate pattern. This pattern is very 
similar to that seen with the molecule PECAM-1, but at a lower intensity. PECAM-1 is an 
integrin expressed on endothelial cells. In addition, lung and brown fat exhibited a much 
higher signal in a more ubiquitous pattern in all embryonic stages examined. Heart and 
kidney also have a higher expression, but to a lesser degree. Adult mouse INTERCEPT 
10 258 expression was seen in many tissues, often in a multifocal, punctate pattern suggestive 
of vessels. Expression was also predominant in many highly vascularized tissues such as 
ovary (especially the septol region), kidney and adrenal cortex. 

In general, both embryonic and adult expression patterns were suggestive of 
endothelial cells being a component in the expression patters observed. In summary, 
1 5 tissues in which INTERCEPT 258 expression was observed were as follows: brain, eye, 
harderian gland, submanibular gland, bladder, brown fat, stomach, heart, kidney, adrenal 
gland, colon, liver, thymus, lymph node, spleen, spinal cord, ovary, testes and placenta. 

As shown in Figure 22, human INTERCEPT 258 protein and mouse INTERCEPT 
258 protein are 62.8% identical. 
20 Mouse INTERCEPT 258 exhibits homology to a human A33 antigen. Figure 25 

shows an alignment of mouse INTERCEPT 258 amino acid sequence (SEQ ID NO:39) 
with the human A33 amino acid sequence (SEQ ID NO:96). The alignment shows that 
there is a 23% overall amino acid sequence identity between the two sequences. Figures 
26A-26D show an alignment of the mouse INTERCEPT 258 nucleotide sequence (SEQ 
25 ID NO:37) with that of the human A33 nucleotide sequence (SEQ ID NO:97). The 
alignment shows that there is a 40% identity between these two nucleotide sequences. 

Clone EpTm258, which encodes mouse INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207221. This deposit will be 
30 maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. § 1 12. 

35 Uses of INT ERCEPT 258 Nucleic acids. Polypeptides, and Modulators Thereof 
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INTERCEPT 258 was identified as being expressed in a mixed lymphocyte library. 
In light of this, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate processes involved in lymphocyte development, differentiation and 
activity, including, but not limited to development, differentiation and activation of T 

5 cells, including T helper, T cytotoxic and non-specific T killer cell types and subtypes, and 
B cells, immune functions associated with such cells, and amelioration of one or more 
symptoms associated with abnormal function of such cell types. Such disorders can 
include, but are not limited to, autoimmune disorders, such as organ specific autoimmune 
disorders, e.g., autoimmune thyroiditis, Type I diabetes mellitus, insulin-resistant diabetes, 

1 0 autoimmune anemia, multiple sclerosis, and/or systemic autoimmune disorders, e.g. , 
rheumatoid arthritis, lupus or sclerodoma, allergy, including allergic rhinitis and food 
allergies, asthma, psoriasis, graft rejection, transplantation rejection, graft versus host 
disease, pathogenic susceptibilities, e.g., susceptibility to certain bacterial or viral 
pathogens, wound healing and inflammatory reactions. 

1 5 INTERCEPT 258 includes one or more Ig domains. INTERCEPT 258 nucleic 

acids, proteins, and modulators thereof can, therefore, be used to modulate immune 
function, e.g., by the modulation of immunoglobulins and the formation of antibodies. 
For the same reason, INTERCEPT 258 nucleic acids, proteins, and modulators thereof can 
be used to modulate immune response, leukocyte trafficking, cancer, Type I immunologic 

20 disorders, e.g., anaphylaxis and/or rhinitis, by modulating the interaction between antigens 
and cell receptors, e.g., high affinity IgE receptors. 

INTERCEPT 258 exhibits homology to PECAM-1, a cell adhesion integrin 
molecule that has been shown to mediate cell-cell interactions, play an important role in 
bidirectional signal transduction, and may be involved in thrombotic, inflammatory and 

25 immunological disorders. As such, INTERCEPT 258 nucleic acids, proteins, and 
modulators thereof can be utilized to modulate cell/cell interactions and, for example, 
signal transduction events associated with such interactions. For example, such 
INTERCEPT 258 compositions and modulators thereof can be used to modulate binding 
of cellular factors or ECM-associated factors such as integrin and can function to modulate 

30 ligand binding to cell surface receptors. Further, such INTERCEPT 258 compositions and 
modulators thereof can be utilized to ameliorate at least one symptom associated with 
thrombotic disorders, e.g., stroke, inflammatory processes or disorders, and immune 
disorders. 

In light of INTERCEPT 258 expression, INTERCEPT 258 nucleic acids, proteins 
35 and modulators thereof can be utilized modulate development, differentiation, 

proliferation and/or activity of pulmonary system cells, e.g., lung cell types, and to 
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modulate a symptom associated with disorders of pulmonary development, differentiation 
and/or activity, such as lung diseases or disorders associated with abnormal pulmonary 
development or function, e.g., cystic fibrosis. INTERCEPT 258 nucleic acids, proteins 
and modulators thereof can also be utilized modulate development, differentiation, 

5 proliferation and/or activity of thyroid cells, megakaryocytes or mammary gland cells, 

and can further be utilized to ameliorate at least one symptom of disorders associated with, 
abnormal thyroid function, e.g., thyroiditis or Grave's disease, abnormal megakaryocyte 
differentiation or function, e.g., anemias or leukemias, hematological diseases such as 
thrombocytopenia, platelet disorders and bleeding disorders, such as hemophilia or 

1 0 abnormal mammary development or function. 

INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can be used to 
treat renal (kidney) disorders, such as glomerular diseases {e.g., acute and chronic 
glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 

15 as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases {e.g., acute tubular necrosis and acute renal failure, polycystic 
renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases {e.g., pyelonephritis, drug and toxin 

20 induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases {e.g., hypertension and nephrosclerosis, 
microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors {e.g., renal cell carcinoma and nephroblastoma). 

25 INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can also be used to treat 
disorders of the brain, such as cerebral edema, hydrocephalus, brain herniations, iatrogenic 
disease (due to, e.g., infection, toxins, or drugs), inflammations {e.g., bacterial and viral 
meningitis, encephalitis, and cerebral toxoplasmosis), cerebrovascular diseases {e.g., hypoxia, 
ischemia, and infarction, intracranial hemorrhage and vascular malformations, and 

30 hypertensive encephalopathy), and tumors {e.g., neuroglial tumors, neuronal tumors, tumors 
of pineal cells, meningeal tumors, primary and secondary lymphomas, intracranial tumors, 
and medulloblastoma), and to treat injury or trauma to the brain. 

INTERCEPT 258 nucleic acids, proteins, and modulators thereof can still further 
be utilized to modulate development, differentiation proliferation and/or activity of cells 

35 involved in kidney or heart formation and function. In addition, such compositions and 
modulators thereof can be utilized to ameliorate at least one symptom of disorders 
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associated with abnormal kidney or heart formation or function, including, but not limited 
to nephritis, coronary disease, atherosclerosis and plaque formation. 

INTERCEPT 258 expression indicates that INTERCEPT 258 is involved, in 
addition to the above, in such processes as thermogenesis, adipocyte function, and 

5 vascularization. As such, INTERCEPT 258 nucleic acids, proteins, and modulators 

thereof can be utilized to modulate such processes as well as for ameliorating at least one 
symptom associated with such processes. Such disorders include, but are not limited to 
obesity, regulation of body temperature, and disorders involving abnormal vascularization, 
e.g. , vascularization of solid tumors. 

10 In further light of INTERCEPT 258 expression, as well as in light of its homology 

to A33 antigen, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate cell proliferation, including, for example, epithelial, e.g., 
gastrointestinal tract epithelial cell proliferation, and to ameliorate at least one symptom of 
cell proliferative disorders such as cancer, and, in particular, chronic myelogenous 

1 5 leukemia, colon cancers, small bowel epithelium cancers and other gastrointestinal tract 
cancers. Further, INTERCEPT 258 expression can be utilized as a marker for specific 
tissues (e.g., vascularized tissues) and/or cells (e.g., endothelial cells) in which 
INTERCEPT 258 is expressed. INTERCEPT 258 nucleic acids can also be utilized for 
chromosomal mapping. 

20 

Human TANGO 281 

A cDNA encoding human TANGO 28 1 was identified by analyzing the sequences of 
clones present in a human megakarocyte cDNA library. This analysis led to the identification 
of a clone, AThPb8 Id 1 0, encoding full-length human TANGO 28 1 . The human TANGO 28 1 

25 cDNA of this clone is 1 8 1 2 nucleotides long (Figures 28A-28B; SEQ ID NO:46). The open 
reading frame of this cDNA, nucleotides 65 to 799 of SEQ ID NO:46 (SEQ ID NO:47), 
encodes a 245 amino acid transmembrane protein (Figures 28A-28B; SEQ ID NO:48). 

The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 
Engineering 10:1-6) predicted that human TANGO 281 includes an 38 amino acid signal 

30 peptide (amino acid 1 to amino acid 38 of SEQ ID NO:48; SEQ ID NO:49) preceding the 
mature TANGO 281 protein (corresponding to amino acid 39 to amino acid 245 of SEQ ID 
NO:48; SEQ ED NO:50). The molecular weight of TANGO 281 without post-translational 
modifications is 26.5 kDa prior to the cleavage of the signal peptide, 20.2 kDa after cleavage 
of the signal peptide. 

35 Human TANGO 281 is a transmembrane protein which contains one or more of the 

following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
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cytoplasmic domain. The human TANGO 281 protein contains an extracellular domain at 
amino acids 1 to 123 of SEQ ID NO:48 or a mature extracellular domain at about amino acid 
residues 39 to 123 of SEQ ED NO:48 (SEQ ED NO:51), a transmembrane domain at amino 
acid residues 124 to 148 of SEQ ED NO:48 (SEQ ED NO:52), and a cytoplasmic domain at 
5 amino acid residues 149 to 245 of SEQ ID NO:48 (SEQ ID NO:53). 

Figure 29 depicts a hydropathy plot of human TANGO 281. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 
10 trace. The dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID 
NO:48; SEQ ED NO:49) on the left from the mature protein (amino acids 38 to 245 of SEQ 
ID NO:48; SEQ ED NO:50) on the right. 

Human TANGO 281 comprises photosystem II 10 kD phosphoprotein (PSBH) 
domain sequences, which have been shown to be phosphorylated in a light-dependent 

15 reaction, from amino acids 41 to 90 and 127 to 182 of SEQ ID NO:48 (SEQ ID NO:54 and 
SEQ ED NO:55, respectively). Figure 30 depicts an alignment between the PSBH domain 
(SEQ ED NO:69; Accession No. PF00737) and human TANGO 281 from amino acids 97 to 
146 of SEQ ID NO:48. An N-glycosylation site having the sequence NTTT is present in 
TANGO 281 at about amino acids 160 to 163 of SEQ ID NO:48. Two protein kinase C 

20 phosphorylation sites are present in human TANGO 28 1 . The first has the sequence S VR (at 
amino acids 8 to 10 of SEQ ID NO:48), and the second has the sequence SSR (at amino acids 
87 to 89 of SEQ ED NO:48). Three casein kinase II phosphorylation sites are present in 
human TANGO 281. The first has the sequence SIPE (at amino acids 49 to 52 of SEQ ID 
NO:48), the second has the sequence SCPD (at amino acids 53 to 56 of SEQ ID NO:48), and 

25 the third has the sequence SSLD (at amino acids 108 to 111 of SEQ ID NO:48). Human 
TANGO 281 has two N-myristylation sites. The first has the sequence GSCSSQ (at amino 
acids 60 to 65 of SEQ ID NO:48), and the second has the sequence GATVAI (at amino acids 
1 19 to 124 of SEQ ED NO:48). 

Nucleic acid base pairs 413 to 746 of human TANGO 281 (SEQ ED NO:46) have 

30 81% identity to the nucleic acid sequence identified as Accession Number AV34245 . Nucleic 
acid base pairs 438 to 746 of human TANGO 281 (SEQ ID NO:46) have 80% identity to a 
nucleic acid sequence referred to as "gene 31" described in PCT Publication No. WO 
98/39446 (SEQ ED NO:70). "Gene 31" is characterized as being expressed primarily in brain 
and thymus, and to a lesser extent in such organs as liver, skin, bone and bone marrow. 

35 Clone EpT281 was deposited with the American Type Culture Collection (10801 

University Boulevard, Manassas, V A 20 1 1 0-2209) on April 2 1 , 1 999 and assigned Accession 



-55- 



WO 00/78808 PCT/US00/16883 

Number 207222. This deposit will be maintained under the terms of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in the art and 
is not an admission that a deposit is required under 35 U.S.C. § 1 12. 

5 

Mouse TANGO 281 

A cDNA encoding mouse TANGO 281 was identified in a normal mouse 
megakaryocyte library by performing expression profiling on megakarocytes obtained from 
mice with a the deletion of the element of the gata-1 gene responsible for megakaryocyte- 
10 specific expression. This analysis led to the identification of a clone, Atmea49d3, encoding 
full-length mouse TANGO 281. The mouse TANGO 281 cDNA of this clone is 1858 
nucleotides long (Figure 30; SEQ ID NO:56). The open reading frame of this cDNA, 
nucleotides 90 to 728 of SEQ ID NO:56 (SEQ ID NO:57), encodes a 213 amino acid 
transmembrane protein (Figure 30; SEQ ID NO:58). 
15 The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 

Engineering 10:1-6) predicted that mouse TANGO 281 includes an 26 amino acid signal 
peptide (amino acid 1 to amino acid 26 of SEQ ID NO:58; SEQ ID NO:59) preceding the 
mature TANGO 281 protein (corresponding to amino acid 27 to amino acid 213 of SEQ ID 
NO:58; SEQ ID NO:60). The molecular weight of mouse TANGO 281 without post- 
20 translational modifications is 22.9 kDa prior to the cleavage of the signal peptide, 20.2 kDa 
after cleavage of the signal peptide. 

Mouse TANGO 281 is a transmembrane protein which contains one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. The mouse TANGO 281 protein contains an extracellular domain at 
25 amino acid residues 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:61), a transmembrane domain 
at amino acid residues 1 13 to 137 of SEQ ID NO:58 (SEQ ID NO:62), and a cytoplasmic 
domain at amino acid residues 138 to 213 of SEQ ID NO:58 (SEQ ID NO:63). 

Figure 32 depicts a hydropathy plot of mouse TANGO 281 . Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
30 of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 
trace. The dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ID 
NO:58; SEQ ID NO:59) on the left from the mature protein (amino acids 27 to 213 of SEQ 
ID NO:58; SEQ ID NO:60) on the right. 
3 5 Mouse TANGO 281 comprises photosystem II 1 0 kD phosphoprotein (PSBH) domain 

sequences, which have been shown to be phosphorylated in a light-dependent reaction, from 
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amino acids 42 to 91 and 128 to 183 of SEQ ID NO:58 (SEQ ID NO:64 and SEQ ID NO:65, 
respectively). Two N-glycosylation sites having the sequences NTTT (at amino acids 149 
to 152 of SEQ ID NO:58) and NASS (at about amino 189 to 192 of SEQ ID NO:58) are 
present in TANGO 281. A glycosaminoglycan attachment site having the sequence SGFG 

5 is present in mouse TANGO 281, and protein kinase C phosphorylation site having the 
sequence SSR is present in mouse TANGO 28 1 . Two casein kinase II phosphorylation sites 
are present in human TANGO 281. The first has the sequence TPAE (at amino acids 80 to 
83 of SEQ ED NO:58), and the second has the sequence SSFD (at amino acids 97 to 100 of 
SEQ ID NO:58). Mouse TANGO 281 has two N-myristylation sites. The first has the 

10 sequence GSCSNQ (at amino acids 48 to 53 of SEQ ID NO:58), and the second has the 
sequence GATVAI (at amino acids 1 08 to 1 1 3 of SEQ ID NO:58). 

Northern blot analysis of mouse TANGO 281 expression revealed two mRNA bands, 
one of approximately 1 .8 kb and another approximately 1 .4 kb. Expression of the 1 .8 kb band 
was detected in the heart, spleen, lung and kidney, with the greatest abundance detected in the 

1 5 heart and lung, followed by the kidney and trace amounts in the spleen. Expression of the 1 .4 
kb band was detected in the brain, spleen, and lung. Expression of the 1.4 kb and 1.8 kb 
species of mouse TANGO 28 1 was detected in 7 day old normal mouse embryos. Neither the 
1 .4 kb or the 1 . 8 kb species of mouse TANGO 28 1 were detected in 1 1 day old normal mouse 
embryos. The 1.8 kb species of mouse TANGO 281 was detected in 15 day old normal 

20 mouse embryos at 20 % the level detected in 7 day old normal mouse embryos. Expression 
of the 1.8 kb species detected in 17 day old normal mouse embryos was comparable to the 
level of expression detected in 7 day old normal mouse embryos. Expression of mouse 
TANGO 281 expression was greatly reduced in megakaryocytes obtained from gata-1 
knockout mice. 

25 In situ tissue screening was performed on mouse adult and embryonic tissues to 

analyze for the expression of mouse TANGO 281 mRNA. Mouse TANGO 281 expression 
was detected predominantly in the adult lymphoid tissues such as the thymus, lymph node, 
and spleen. In particular, mouse TANGO 28 1 expression was detected in the following adult 
tissues: a moderate, ubiquitous signal was detected in the submandibular gland; a strong, 

30 ubiquitous signal was detected in the adrenal gland; a strong, multifocal signal was detected 
in the medulla of the thymus and a moderate, ubiquitous signal was detected in the cortex of 
the thymus; a strong signal was detected in the lymph node; a strong signal was detected in 
the follicles of the spleen; a weak signal was detected in the mucosal epithelium of the 
bladder; a strong signal was detected in the ovaries; a ubiquitous signal was detected in the 

35 placenta; a moderate signal was detected in the muscle region Of the stomach; a weak signal 
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in a pattern outlining many of the large airways was detected in lung; a weak, ubiquitous 
signal was detected in the liver; and a weak, ubiquitous signal was detected in the kidney. 

In the case of embryonic expression, mouse TANGO 281 expression was detected in 
the lung, stomach, thymus and submaxillary gland. In particular, at El 6.5 a weak to moderate 
5 signal was detected in the intestine and stomach, and a moderate, ubiquitous signal was 
detected in the lung. At PI. 5, a signal was detected in the lung, stomach, thymus, and 
submaxillary gland. 

Figure 33 shows that there is an overall 66.5% identity between the precursor human 
TANGO 281 amino acid sequence and the precursor mouse TANGO 281 amino acid 
10 sequence. 

Clone EpT281 was deposited with the American Type Culture Collection (10801 
University Boulevard, Manassas, VA 20110-2209) on June 15, 1999 and assigned patent 
deposit Number PTA-224. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes 
1 5 of Patent Procedure. This deposit was made merely as a convenience for those of skill in the 
art and is not an admission that a deposit is required under 35 U.S.C. §112. 

Uses of TANGO 281 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 281 was originally found in a megakaryocyte library, TANGO 281 

20 nucleic acids, proteins, and modulators thereof can be used to modulate the proliferation, 
differentiation, and/or function of megakaryocytes and platelets. TANGO 28 1 nucleic acids, 
proteins, and modulators thereof can be used to treat associated hematological diseases such 
as thrombocytopenia, platelet disorders and bleeding disorders (e.g., hemophilia). TANGO 
281 nucleic acids, proteins, and modulators thereof can be used to modulate platelet 

25 aggregation and degranulation. Further, as TANGO 281 expression varies in mouse embryos 
during development, TANGO 28 1 nucleic acids, proteins, and modulators thereof can be used 
to modulate the development of cells, tissues or organs in embryos. 

As TANGO 28 1 expression is greatly reduced in megakaryocytes obtained from gata- 
1 knockout mice compared normal mice, TANGO 281 is either a direct or indirect target of 

30 gata- 1 and has profound biological implications. Gata-1 is a transcription factor involved in 
the development of hemapoietic cell lineages — gata-1 expression is required for proper 
development of erythocytes and megakaryocytes. Although deletion of the gata-1 gene is 
lethal at the embryonic stage due to a failure to form red blood cells, deletion of only the 
element of the gata-1 gene responsible for megakaryocyte-specific expression (a 1 0 kb region 

35 of genomic DNA containing a megakaryocyte specific DNase I hypersensitive) is not lethal 
and results in a reduction in gata-1 expression in the megakaryocyte without affecting gata-1 



-58- 



WO 00/78808 



PCT/US00/16883 



expression in red blood cells. The megakaryocytes of mice with this element of the gata-1 
gene knocked out fail to develop into mature platelets, and the mice experience abnormal 
bleeding due to their profound thrombocytopenia. TANGO 281 nucleic acids, proteins, and 
modulators thereof can be used to treat disease and/or disorders associated with gata-1 
5 dysfunction. In light of the reduced expression of TANGO 281 in gata-1 knockout mice, 
TANGO 281 expression can be utilized as a marker for modulators of gata-1 expression 
and/or activity. 

As TANGO 281 is expressed in the heart, brain, spleen, lung, kidney, embryo and 
megakaryocytes, TANGO 281 nucleic acids, proteins, and modulators thereof can be used to 

1 0 treat disorders of these cells, tissues, or organs, e.g. , ischemic heart disease or atherosclerosis, 
head trauma, brain cancer, splenic lymphoma, splenomegaly, lung cancer, cystic fibrosis, 
rheumatoid lung disease, glomerulonephritis, end stage renal disease, uremia, DiGeorge 
syndrome, thymoma, autoimmune disorders, atresia, Crohns's disease, and various embryonic 
disorders. TANGO 281 nucleic acids, proteins, and modulators thereof can be used to 

1 5 modulate the bleeding associated with uremia. Further, TANGO 28 1 nucleic acids, proteins, 
and modulators thereof can be used to treat hypercoagulation associated with a damaged 
endothelium, e.g., pre-eclampsia, malignant hypertension, disseminated intravascular 
coagulopathy, renal transplant rejection, cyclosporin toxicity, microangiopathic hemolytic 
anemia, and thrombotic thrombocytopenic purpura. 

20 TANGO 28 1 exhibits homology to a gene referred to as "gene 3 1 " (PCT Publication 

No. W098/39446), which is expressed primarily in the brain and thymus. In light of this, 
TANGO 281 nucleic acids, proteins and modulators thereof can be utilized to ameliorate at 
least one symptom associated with central nervous (CNS) disorders, hematopoietic disorder, 
and disorders of the endocrine system. 

25 Further, in light of TANGO 281*s pattern of expression in mice, TANGO 281 

expression can be utilized as a marker for specific tissues (e.g., lymphoid tissues such as the 
thymus and spleen) and/or cells (e.g., lymphocytes) in which INTERCEPT 281 is expressed. 
TANGO 28 1 nucleic acids can also be utilized for chromosomal mapping. 

Tables 1-4 below provide a summary of the sequence information for TANGO 253, 

30 TANGO 257, INTERCEPT 258 and TANGO 28 1 . 

TABLE 1: Summary of Human TANGO 253, TANGO 257, INTERCEPT 258, and 
TANGO 281 Sequence Information 



Gene 


cDNA 


ORF 


Figure 


Accession 
Number 


TANGO 253 


SEQ ID NO:l 


SEQ ID NO:2 


Figure 1 


207222 


TANGO 257 


SEOIDNO:15 


SEQ ID NO: 16 


Figures 9A-9B 


207222 
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INTERCEPT 
258 


SEQIDNO:26 


SEQ1DN0:27 


Figure 17 


207222 


TANGO 281 


SEQ ID NO:46 


SEQ ID NO:47 


Figures 27 


207222 



5 



10 



15 



20 



25 



30 



35 
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Cytoplasmic 






aa 225-246 
of 

SEQID NO:28 
(SEQ ID NO:79) 


aa 149-245 
of 

SEQ ID NO:48 
(SEQ ID NO:53) 


Transmembrane 






aa 207-224 
of 

SEQIDNO:28 

(SEQ IDNO:78); 

aa 247-271 of 
SEQ ED NO: 28 

(SEQ ID NO: 33) 


aa 124-148 
of 

SEQIDNO:48 
(SEQ ID NO:52) 


Collagen 


aa 36-45 
of 

SEQIDNO:3 (SEQ 
IDNO:6) 








Clq 


aa 102-232 
of 

SEQIDNO:3 
(SEQ ID NO: 7) 














aa 49-128; 
167-226 of 

SEQID 

NO:28 

(SEQID 

NO:35; 

SEQ ID 1 

NO:36) 




PSBH 








aa 41-90; 12- 
187 

of 

SEQID 
NO:48 

(SEQID 
NO:54; SEQ 
ID NO:55) 


Extracellular 






aa 30-206 of 
SEQ ID NO: 28 
(SEQ ID NO: 
76) 

aa 272-370 of 
SEQ ID NO: 28 
(SEQ ID NO: 
34) 


aa 39-123 
of 

SEQIDNO:48 
(SEQIDNO:51) 


Mature Protein 


m do 
S 

2 OO 
™ w w 

00 w 


aa 22-406 
of 

SEQIDNO:17 
(SEQ ID NO: 18) 


aa 30-370 
of 

SEQIDNO:28 
(SEQIDNO:29) 


aa 39-245 
of 

SEQIDNO:48 
(SEQ ID NO:50) 


Signal 
Sequence 


aa 1-15 
of 

SEQIDNO:3 
(SEQ ED NO:5) 


aal-21 
of 

SEQIDNO:17 
(SEQ ID NO: 1 9) 


aa 1-29 
of 

SEQIDNO:28 
(SEQIDNO:30) 


aa 1-38 
of 

SEQIDNO:48 
(SEQIDNO:49) 


Protein 


TANGO 253 


TANGO 257 


INTERCEPT 
258 


TANGO 281 
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TABLE 3: Summary of Mouse TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 Sequence Information 



Gene 


cDNA 


ORF 


Figure 


Accession 
Number 


TANGO 253 


SEQ ID NO:8 


SEQ ID NO:9 


Figures 3A-3B 


207215 


TANGO 257 


SEQIDNO:21 


SEQIDNO:22 


Figures 11A-11B 


207217 


[NTERCEPT 
258 


SEQIDNO:37 


SEQ ID NO:38 


Figures 20A-20B 


207221 


TANGO 281 


SEQIDNO:56 


SEQ ID NO:57 


Figures 3 1A-3 IB 


PTA-224 
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Various aspects of the invention are described in further detail in the following subsections: 
I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
a polypeptide of the invention or a biologically active portion thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of such nucleic acid molecules 
suitable for use as PCR primers for the amplification or mutation of nucleic acid 
molecules. As used herein, the term "nucleic acid molecule" is intended to include DNA 
molecules (e.g., cDNA or genomic DNA) and RNA molecules (eg., mRNA) and analogs 
of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can 
be single-stranded or double-stranded, but preferably is double-stranded DNA. In one 
embodiment, the nucleic acid molecules of the invention comprise a contiguous open 
reading frame encoding a polypeptide of the invention. 

15 An "isolated" nucleic acid molecule is one which is separated from other nucleic 

acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 

20 nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, or 

25 culture medium when produced by recombinant techniques, or substantially free of 

chemical precursors or other chemicals when chemically synthesized. As used herein, the 
term "isolated" when referring to a nucleic acid molecule does not include an isolated 
chromosome. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
30 having the nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 
47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 
181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a complement thereof, 
35 can be isolated using standard molecular biology techniques and the sequence information 
provided herein. Using all or a portion of the nucleic acid sequences of SEQ ID NO:l, 2, 
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8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 
174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 
5 or 192, as a hybridization probe, nucleic acid molecules of the invention can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook et al., 
eds., Molecular Cloning: A Laboratory Manual, 2nd ed f Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

A nucleic acid molecule of the invention can be amplified using cDNA, mRNA or 
10 genomic DNA as a template and appropriate oligonucleotide primers according to 

standard PGR amplification techniques. The nucleic acid so amplified can be cloned into 
an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 
invention can be prepared by standard synthetic techniques, e.g., using an automated DNA 
15 synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 

2Q 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the cDNA insert 
of a clone deposited with the ATCC® as Accession number 207222, Accession Number 

25 2072 15, Accession number 2072 1 7, Accession Number 20722 1 or patent deposit Number 
PTA-224, or a portion thereof A nucleic acid molecule which is complementary to a 
given nucleotide sequence is one which is sufficiently complementary to the given 
nucleotide sequence that it can hybridize to the given nucleotide sequence thereby forming 
a stable duplex. 

30 Moreover, a nucleic acid molecule of the invention can comprise only a portion of 

a nucleic acid sequence encoding a full length polypeptide of the invention for example, a 
fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of a polypeptide of the invention. The nucleotide sequence determined from 
the cloning one gene allows for the generation of probes and primers designed for use in 

35 identifying and/or cloning homologues in other cell types, e.g y from other tissues, as well 
as homologues from other mammals. The probe/primer typically comprises substantially 
purified oligonucleotide. In one embodiment, the oligonucleotide comprises a region of 
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nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, more preferably about 50, 75, 100, 125, 150, 175, 200, 250, 300, 350 or 400 
consecutive oligonucleotides of the sense or anti-sense sequence of SEQ ID NO: 1, 2, 8, 9, 
15, 16,21,22, 26,27, 37,38,46, 47,56, 57,77, 80,91, 100, 101, 103, 105, 107, 109, 111, 

5 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 
175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 
192, or the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® 
as Accession number 207222, Accession Number 207215, Accession Number 207217, 

1 0 Accession Number 20722 1 , or patent deposit Number PTA-224, or of a naturally 

occurring mutant of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80,91, 100, 101, 103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 

15 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. In another embodiment, the 

oligonucleotide comprises a region of nucleotide sequence that hybridizes under stringent 
conditions to at least 400, preferably 450, 500, 530, 550, 600, 700, 750, 800, 850, 900, 
1000, 1 100, 1200 or more consecutive oligonucleotides of the sense of antisense sequence 
of SED ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 

20 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
187, 188, 189, 190, 191 or 192 , or the nucleotide sequence of the cDNA insert of a clone 
deposited with the ATCC® as Accession number 207222, Accession number 207215, 

25 Accession number 2072 1 7, Accession Number 20722 1 , or patent deposit Number PTA- 
224, or of a naturally occurring mutant of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 
37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 

30 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. 

In a preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 450, 
preferably about 500, 550, 600, 650, 700, 750, 800, 850, 900, 1 000, 1 1 00 or 1 300 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:l, 103, 105, 
35 1 07 or 1 09, or a naturally occurring mutant of SEQ ID NO: 1 , 1 03, 105, 1 07, or 109. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
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nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:2, 91 , 100, 101 
or 80. 

^ In another preferred embodiment, the oligonucleotide typically comprises a region 

of nucleotide sequence that hybridizes under stringent conditions to at least about 540, 
preferably about 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:8, 1 19, 121, 
123 or 125, or of a naturally occurring mutant of SEQ ED NO:8, 1 19, 121, 123 or 125. In 

1 0 another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 3 1 0, 
preferably about 350, 400, 450, 500, 550, 600, 650 or 700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:9, 1 74, 1 75, 1 76 or 1 77, or of a naturally 
occurring mutant of SEQ ID NO:9, 174, 175, 176 or 177. 

15 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 1800 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO: 1 5, 1 1 1 , 1 1 3, 
115 or 117, or ofanaturally occurring mutant of SEQ ID NO:15, 111, 113, 115 or 117. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 1 150 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:16, 170, 171, 
172 or 173, or of a naturally occurring mutant of SEQ ID NO:16, 170, 171, 172 or 173. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
25 of nucleotide sequence that hybridizes under stringent conditions to at least about 1 1 00, 
preferably about 1200, 1300, 1400, 1500, 16500 or 1700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:21, 127, 129, 131 or 133, or of a naturally 
occurring mutant of SEQ ID NO:21, 127, 129, 131 or 133. In another preferred 
embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
30 hybridizes under stringent conditions to at least about 1 1 50 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:22, 1 78, 1 79, 1 80 or 1 8 1 , or of a naturally 
occurring mutant of SEQ ID NO:22, 178, 179, 180 or 181. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 420, 
35 preferably about 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
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NO:26, 135, 137, 139 or 141, or of a naturally occurring mutant of SEQ ID NO:26, 135, 
137, 139 or 141 . In another preferred embodiment, the oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
5 600, 700, 800, 900, 1 000, 1 1 00, 1 200, 1 300, 1 400, 1 500, 1 600, 1 700 or 1 800 consecutive 
nucleotides of the sense or anti-sense sequence of SEQ ID NO:27, 182, 183, 184 or 185, 
or of a naturally occurring mutant of SEQ ID NO:27, 182, 183, 184 or 185. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 675, 

10 preferably about 700, 800, 900, 1000, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 

consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:37, 143, 145, 
147 or 149, or of a naturally occurring mutant of SEQ ID NO:37, 143, 145, 147 or 149. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 500, 

15 preferably about 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 
1 800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:38, 1 86, 
187, 188 or 189, or of a naturally occurring mutant of SEQ ID NO:38, 186, 187, 188 or 
189. 



20 



25 



In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 
900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of 
the sense or anti-sense sequence of SEQ ED NO:46, 151, 153, 155 or 157, or of a naturally 
occurring mutant of SEQ ID NO:46, 151, 153, 155 or 157. In another preferred 
embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, preferably about 25, more 
preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 
1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of the sense or anti-sense 
sequence of SEQ ID NO:47, 190, 191, 192 or 77, or of a naturally occurring mutant of 
SEQ ID NO:47, 190, 191, 192 or 77. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 550, 
preferably about 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 
35 1 800 or 1 850 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
NO:56, 159, 161, 163 or 165, or of a naturally occurring mutant of SEQ ID NO:56, 159, 
161, 163 or 165. In another preferred embodiment, the oligonucleotide typically 
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comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
600 or 700 consecutive nucleotides of the sense or anti-sense sequence of SEQ ED NO:57, 
166, 167, 168 or 169, or of a naturally occurring mutant of SEQ ID NO:57, 166, 167, 168 
5 or 169. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences encoding the same protein molecule 
encoded by a selected nucleic acid molecule. The probe comprises a label group attached 
thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. 
1 0 Such probes can be used as part of a diagnostic test kit for identifying cells or tissues 
which mis-express the protein, such as by measuring levels of a nucleic acid molecule 
encoding the protein in a sample of cells from a subject, e.g., detecting mRNA levels or 
determining whether a gene encoding the protein has been mutated or deleted. 

A nucleic acid fragment encoding a biologically active portion of a polypeptide of 
the invention can be prepared by isolating a portion of any of SEQ ED NO:3, 10, 17, 23, 
28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164 
expressing the encoded portion of the polypeptide protein (e.g., by recombinant expression 
20 in vitro) and assessing the activity of the encoded portion of the polypeptide. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

25 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit number PTA-224 due to degeneracy of the genetic code and thus encode the 

30 same protein as that encoded by the nucleotide sequence of SEQ ID NO: 1 , 2, 8, 9, 15, 1 6, 
21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 1 1 1, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, or 

35 the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® as 
Accession Number 207222, Accession Number 207215, Accession Number 207217, 
Accession Number 207221 or patent deposit Number PTA-224. 
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In addition to the nucleotide sequences of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
5 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, it will be 
appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequence may exist within a population (e.g. , the human 
population). Such genetic polymorphisms may exist among individuals within a 
population due to natural allelic variation. 

* 0 An allele is one of a group of genes which occur alternatively at a given genetic 

locus. As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide of the invention. Such natural 

1 5 allelic variations can typically result in 1 -5% variance in the nucleotide sequence of a 
given gene. Alternative alleles can be identified by sequencing the gene of interest in a 
number of different individuals. This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of individuals. Any and all such 
nucleotide variations and resulting amino acid polymorphisms or variations that are the 

20 result of natural allelic variation and that do not alter the functional activity are intended to 
be within the scope of the invention. 

The human gene for TANGO 253 has been mapped to the long arm of 
chromosome 1 1 . Flanking markers for this region are Dl 1 5 1 356 and D 1 1 5924. The 
Jacobsen syndrome (JBS), ED4 (ectodermal dysplasia 4), CMT4B (Charcot Marie Tooth 
neuropathy), PORC (porphyria, acute) loci also map to this region of the human 
chromosome. The APOPLP1 (apolipoprotein cluster), DRD2 (dopamine receptor 2), 
PGL1 (paraganglioma glomus tumors), RDX (radixin), NCAM1 (neural cell adhesion 
molecule), ARCN1 (archain 1), and EL-10R (IL-10 receptor) genes map to this region of 
the human chromosome. This region is syntenic to mouse chromosome 9. The ruf (rough 
fur), lu (luxoid), and atm (ataxia telangiectasia gene mutated in human being) loci also 
mpa to this region of the mouse chromosome. The ruf (rough fur), lu (luxoi), hmbs 
(hydroxymethylbilane synthase), IL-lORa (IL-10 receptor a), and drd2 (dopamine 
receptor 2) genes also map to this region of the mouse chromosome. 

35 The human gene for TANGO 257 has been mapped to chromosome 1 . Flanking 

markers for ths region are WI-7614 and FB14F9. The WS2B (Waardenburg syndrome) 
loci also maps to this region of the human chromosome. The NGF-P (nerve growth 
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factor-P), TSHB (thyroid stimulating hormone), and GSTM1 (glutathione S-transferase 
cluster) genes also map to this region of the human chromosome. This region is syntenic 
to mouse chromosome 3. The de (droopy ear) loci maps to this region of the mouse 
chromosome. The NGF-0 (nerve growth factor-p), TSHB (thyroid stimulating hormone), 
5 and BCAN (brevican) genes also map to this region of the mouse chromosome. 

The human gene for INTERCEPT 258 has been mapped to the long arm of 
chromosome 1 1, in the region q23. Flanking markers for this region are Dl 1S936 and 
Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 (ecotodermal dysplasia), 
JBS (Jacobsen Syndrome), and TCPT (thrombocytopenia) loci also map to this region of 

10 the human chromosome. The APOLP1 (apolipoprotein cluster), DRD2 (dopamine 
receptor), and RDX (radixin) genes also map to this region of the human chromosome. 
This region is syntenic to mouse chromosome 9. The atm (ataxia telangiectasia), ruf 
(rough fur), and vs (variable spotting) loci map to this region of the mouse chromosome. 
The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), rug (rough fur), and 

15 lapl (leucine arylaminopeptidase) genes also map to this region of the mouse 
chromosome. 

Moreover, nucleic acid molecules encoding proteins of the invention from other 
species (homologues), which have a nucleotide sequence which differs from that of the 
2Q human or mouse protein described herein are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of a cDNA of the invention can be isolated based on their identity to the 
human nucleic acid molecule disclosed herein using the human cDNAs, or a portion 
thereof, as a hybridization probe according to standard hybridization techniques under 
stringent hybridization conditions. For example, a cDNA encoding a soluble form of a 
membrane-bound protein of the invention isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the membrane-bound form. Likewise, a cDNA 
encoding a membrane-bound form can be isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the soluble form. 

30 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 1000, 1 100, 1200 or 
1300 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:l, 103, 105, 107 or 109, or a complement thereof. 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 
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720 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:2, 80, 91, 100 or 101, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
contiguous nucleotides in length and hybridizes under stringent conditions to the nucleic 
acid molecule comprising the nucleotide sequence, preferably the coding sequence, of 
SEQ ID NO:8, 1 19, 121, 123 or 125, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 310, 350, 400, 450, 500, 550, 600, 650 or 700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:9, 
174, 175, 176 or 177, or a complement thereof. 

15 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO: 1 5, 1 1 1 , 1 1 3, 1 1 5 or 1 1 7, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
20 invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO: 16, 170, 171, 172 or 173, or a complement 
thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
25 invention is at least 1100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:21, 
127, 129, 131 or 133, or a complement thereof. 

3 q Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:22, 1 78, 1 79, 1 80 or 1 8 1 , or a complement 
thereof 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 420, 450, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 
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1500, 1600, 1700, or 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID No:26, 135, 137, 139 or 141, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
sequence, preferably the coding sequence, of SEQ ID NO:27, 182, 183, 184 or 185, or a 
complement thereof. 

10 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 675, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 
or 1800 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:37, 143, 145, 147 or 149, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 contiguous nucleotides in length and hybridizes under stringent conditions 
to the nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
20 sequence, of SEQ ID NO:38, 1 86, 1 87, 1 88 or 1 89, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
25 sequence, preferably the coding sequence, of SEQ ID NO:46, 1 5 1 , 1 53, 1 55 or 1 57, or a 
complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:47, 77 190, 191 
or 192, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 550, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 
35 1 700, 1 800 or 1 850 contiguous nucleotides in length and hybridizes under stringent 

conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO:56, 159, 161, 163 or 165, or a complement thereof. 
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Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence, preferably the coding sequence, of SEQ ED NO:57, 166, 167, 168 
or 169, or a complement thereof. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at 
least 60%, 65%, 70%, preferably 75%, identical to each other typically remain hybridized 
to each other. Such stringent conditions are known to those skilled in the art and can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45° C, followed by one 
or more washes in 0.2 X SSC, 0.1% SDS at 50-65° C. Preferably, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent conditions to the sequence of 
15 SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 
170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 , or a complement thereof, corresponds to a naturally-occurring 
nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature 
(e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of the 
invention sequence that may exist in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of the 
protein. For example, one can make nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues. A "non-essential" amino acid residue 
is a residue that can be altered from the wild-type sequence without altering the biological 
activity, whereas an "essential" amino acid residue is required for biological activity. For 
example, amino acid residues that are not conserved or only semi-conserved among 
homologues of various species may be non-essential for activity and thus would be likely 
targets for alteration. Specific examples of conservative amino acid alterations from the 
original amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58 are shown in 
SEQ ID NO: 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164. 
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Alternatively, amino acid residues that are conserved among the homologies of various 
species {e.g., mouse and human) may be essential for activity and thus would not be likely 
targets for alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:3, 102, 104, 106 or 108, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 40%, 45%, 50%, 55%, 60%, 65%, 
10 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:3, 102, 104, 
106 or 108. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ED NO: 10, 1 18, 120, 122 or 124 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 95%, or 98% identical to the amino 
acid sequence of SEQ ID NO:10, 118, 120, 122 or 124. 

20 Accordingly, another aspect of the invention pertains to nucleic acid molecules 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO: 17, 1 10, 1 12, 1 14 or 1 16 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 

25 includes an amino acid sequence that is at least about 88%, 90%, 95% or 98% identical to 
the amino acid sequence of SEQ ED NO:17, 1 10, 1 12, 1 14 or 116. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
30 ID NO:23, 126, 128, 130 or 132 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 88%, 90%, 95%,* or 98% identical to 
the amino acid sequence of SEQ ID NO:23, 126, 128, 130 or 132. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 

35 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
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ID NO:28, 134, 136, 138, 140, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:28, 134, 136, 138, 
5 140. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:39, 142, 144, 146 or 148, yet retain biological activity. In one embodiment, the 
I® isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:39, 142, 144, 146 
or 148. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:48, 150, 152, 154, or 156, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
2Q includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 
60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:48, 150, 152, 154 or 156. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 

25 are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:58, 158, 160, 162 or 164, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 
60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 

30 NO:58, 158, 160, 162 or 164. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, . 
91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
35 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
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185, 186, 187, 188, 189, 190, 191 or 192 such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

5 predicted non-essential amino acid residues. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a 
similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains {e.g., lysine, 
arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar 

1 0 side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), 
nonpolar side chains {e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains {e.g., threonine, valine, isoleucine) and 
aromatic side chains {e.g., tyrosine, phenylalanine, tryptophan, histidine). Alternatively, 
mutations can be introduced randomly along all or part of the coding sequence, such as by 

15 saturation mutagenesis, and the resultant mutants can be screened for biological activity to 
identify mutants that retain activity. Following mutagenesis, the encoded protein can be 
expressed recombinantly and the activity of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that is a variant of a polypeptide 
of the invention can be assayed for: (1) the ability to form protein: protein interactions 
20 with proteins in a signaling pathway of the polypeptide of the invention; (2) the ability to 
bind a ligand of the polypeptide of the invention; or (3) the ability to bind to an 
intracellular target protein of the polypeptide of the invention. In yet another preferred 
embodiment, the mutant polypeptide can be assayed for the ability to modulate cellular 
proliferation, cellular migration or chemotaxis, or cellular differentiation. 

25 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid encoding a polypeptide of the 
invention, e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 
JO k^ 08611 k° n< * t0 a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire coding strand, or to only a portion thereof, e.g., all or part of the protein 
coding region (or open reading frame). An antisense nucleic acid molecule can be 
antisense to all or part of a non-coding region of the coding strand of a nucleotide 
sequence encoding a polypeptide of the invention. The non-coding regions ("5 1 and 3 1 
untranslated regions") are the 5' and 3 1 sequences which flank the coding region and are 
not translated into amino acids. 
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An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 
40, 45 or 50 nucleotides or more in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 

5 oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the antisense 
and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted 
nucleotides can be used. Examples of modified nucleotides which can be used to generate 

10 the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chIorouracil, 

5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

15 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

20 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 

25 orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
a subject or generated in situ such that they hybridize with or bind to cellular mRNA , 
and/or genomic DNA encoding a selected polypeptide of the invention to thereby inhibit 

30 expression, e.g., by inhibiting transcription and/or translation. The hybridization can be 
by conventional nucleotide complementarity to form a stable duplex, or, for example, in 
the case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct 

35 injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
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administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an «-anomeric (alpha) 
nucleic acid molecule. An «-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual P-units, 
the strands run parallel to each other (Gaultier et al. ( 1 987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes {e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide of the invention can be designed based 
upon the nucleotide sequence of a cDNA disclosed herein. For example, a derivative of a 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in a Cech et al. U.S. 
Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, an mRNA 
encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Battel and 
Szostak(1993)Sc/e/ice 261:1411-1418. 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide {e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. NY. Acad. Sci. 660:27-36; 
and Maher (1992) Bioassays 14(12):807-15. 
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In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids 

5 {see Hyrup et al. (1996) Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA 
mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of 
PNAs has been shown to allow for specific hybridization to DNA and RNA under 

10 conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996), supra; 
Perry-O'Keefe et al. (1996) Proc. Natl Acad, Set USA 93: 14670-675. 

PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense or antigene agents for sequence-specific modulation of gene 

1 5 expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., 
PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as probes or primers for 
DNA sequence and hybridization (Hyrup (1996), supra; Perry-O'Keefe et al. (1996) Proc. 

20 Natl. Acad. ScL USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 
cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 

^ of base stacking, number of bonds between the nucleobases, and orientation (Hyrup 
(1996), supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996), supra, and Finn et al. (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds such 

^ as 5 ^-methoxytrityOamino-S'-deoxy-thymidine phosphoramidite can be used as a link 
between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a stepwise manner to produce a 



-80- 



WO 00/78808 



PCT/US00/16883 



chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996) 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 
synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al. (1975) 
Bioorganic Med. Chem. Lett. 5:1119-11124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo ), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; PCT 
Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (see, e.g., Krol et al. (1988) Bio/Techniques 6:958-976) or intercalating 
agents (see, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, hybridization triggered 
cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 



II. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins, and biologically active 
portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise 

20 antibodies directed against a polypeptide of the invention. In one embodiment, the native 
polypeptide can be isolated from cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant DNA techniques. Alternative 
to recombinant expression, ^polypeptide of the invention can be synthesized chemically 

25 using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 

^ precursors or other chemicals when chemically synthesized. The language "substantially 
free of cellular material" includes preparations of protein in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly produced. 
Thus, protein that is substantially free of cellular material includes preparations of protein 
having less than about 30%, 20%, 10%, or 5% (by dry weight) of heterologous protein 

^ (also referred to herein as a "contaminating protein"). When the protein or biologically 
active portion thereof is recombinantly produced, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less than about 20%, 10%, or 5% of the 
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volume of the protein preparation. When the protein is produced by chemical synthesis, it 
is preferably substantially free of chemical precursors or other chemicals, i.e., it is 
separated from chemical precursors or other chemicals which are involved in the synthesis 
of the protein. Accordingly such preparations of the protein have less than about 30%, 
5 20%, 1 0%, 5% (by dry weight) of chemical precursors or compounds other than the 
polypeptide of interest. 

Biologically active portions of a polypeptide of the invention include polypeptides 
comprising amino acid sequences sufficiently identical to or derived from the amino acid 
sequence of the protein (e.g., the amino acid sequence shown in any of SEQ ID NO:4, 6, 

10 7, 13, 14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 
64, 65, 71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 
or 99 which include fewer amino acids than the full length protein, and exhibit at least one 
activity of the corresponding full-length protein! Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 

^ biologically active portion of a protein of the invention can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. Moreover, other biologically 
active portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of the 
native form of a polypeptide of the invention. 

20 

Preferred polypeptides have the amino acid sequence of SEQ ID NO:4, 6, 7, 13, 
14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 
71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99. 
Other useful proteins are substantially identical (e.g., at least about 45%, preferably 55%, 
25 65%, 75%, 85%, 95%, or 99%) to any of SEQ ID NO:4, 6, 7, 13, 14, 18, 23, 28, 33, 34, 
35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 71, 76, 34, 78, 79, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99 and retain the functional 
activity of the protein of the corresponding naturally-occurring protein yet differ in amino 
acid sequence due to natural allelic variation or mutagenesis. 

30 To determine the percent identity of two amino acid sequences or of two nucleic 

acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 

35 compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences is 
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a function of the number of identical positions shared by the sequences (i.e., % identity == 
# of identical positions/total # of positions (e.g., overlapping positions) x 100). In one 
embodiment, the two sequences are the same length. 

The determination of percent identity between two sequences can be accomplished 
using a mathematical algorithm. A preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of two sequences is the algorithm of Karlin and 
Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschul, et al. (1990) J. Mol 
Biol. 215:403-410. BLAST nucleotide searches can be performed with the NBLAST 
program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to a 
nucleic acid molecules of the invention. BLAST protein searches can be performed with 
the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to a protein molecules of the invention. To obtain gapped alignments for 
comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. 
(1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be used to perform 
an iterated search which detects distant relationships between molecules (Id.). When 
utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

Another preferred, non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such 
an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the 
CGC sequence alignment software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 
12, and a gap penalty of 4 can be used. Additional algorithms for sequence analysis are 
known in the art and include ADVANCE and ADAM as described in Torellis and Robotti 
(1994) Comput. Appl. Biosci., 10:3-5; and FASTA described in Pearson and Lipman 
(1988) Proc. Natl. Acad. Sci. 55:2444-8. Within FASTA, ktup is a control option that sets 
the sensitivity and speed of the search. If ktup=2, similar regions in the two sequences 
being compared are found by looking at pairs of aligned residues; if ktup=l, single aligned 
amino acids are examined, ktup can be set to 2 or 1 for protein sequences, or from 1 to 6 
for DNA sequences. The default if ktup is not specified is 2 for proteins and 6 for DNA. 
For a further description of FASTA parameters, see 

http://bioweb.pasteur.fr/docs/man/man/fasta. I.html#sect2, the contents of which are 
incorporated herein by reference. 
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The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 
identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins. As used herein, a 
5 "chimeric protein" or "fusion protein" comprises all or part (preferably biologically active) 
of a polypeptide of the invention operably linked to a heterologous polypeptide (i.e., a 
polypeptide other than the same polypeptide of the invention). Within the fusion protein, 
the term "operably linked" is intended to indicate that the polypeptide of the invention and 
the heterologous polypeptide are fused in-frame to each other. The heterologous 
1 0 polypeptide can be fused to the N-terminus or C-terminus of the polypeptide of the 
invention. 

One useful fusion protein is a GST fusion protein in which the polypeptide of the 
invention is fused to the C-terminus of GST sequences. Such fusion proteins can facilitate 
the purification of a recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal sequence 
at its N-terminus. For example, the native signal sequence of a polypeptide of the 
invention can be removed and replaced with a signal sequence from another protein. For 
example, the gp67 secretory sequence of the baculovirus envelope protein can be used as a 

20 heterologous signal sequence {Current Protocols in Molecular Biology, Ausubel et al., 
eds., John Wiley & Sons, 1992). Other examples of eukaryotic heterologous signal 
sequences include the secretory sequences of melittin and human placental alkaline 
phosphatase (Stratagene; La Jolla, California). In yet another example, useful prokaryotic 
heterologous signal sequences include the phoA secretory signal (Sambrook et al., supra) 

25 and the protein A secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion protein 
in which all or part of a polypeptide of the invention is fused to sequences derived from a 
member of the immunoglobulin protein family. The immunoglobulin fusion proteins of 
the invention can be incorporated into pharmaceutical compositions and administered to a 

30 subject to inhibit an interaction between a ligand (soluble or membrane-bound) and a 
protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo. 
The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate 
ligand of a polypeptide of the invention. Inhibition of ligand/receptor interaction may be 
useful therapeutically, both for treating proliferative and differentiate disorders and for 

35 modulating (eg., promoting or inhibiting) cell survival Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies directed 
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against a polypeptide of the invention in a subject, to purify ligands and in screening 
assays to identify molecules which inhibit the interaction of receptors with ligands. 

Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be synthesized 
by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, e.g., 
Ausubel et al., supra). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., 2l GST polypeptide). A nucleic acid encoding a 
polypeptide of the invention can be cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the polypeptide of the invention. 

A signal sequence of a polypeptide of the invention (SEQ ID NO:5, 12, 19, 25, 30, 
41, 49 or 59) can be used to facilitate secretion and isolation of the secreted protein or 
other proteins of interest. Signal sequences are typically characterized by a core of 
hydrophobic amino acids which are generally cleaved from the mature protein during 
secretion in one or more cleavage events. Such signal peptides contain processing sites 
that allow cleavage of the signal sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to the described polypeptides having a 
signal sequence, as well as to the signal sequence itself and to the polypeptide in the 
absence of the signal sequence (i.e.. the cleavage products). In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention can be operably linked in an 
expression vector to a protein of interest, such as a protein which is ordinarily not secreted 
or is otherwise difficult to isolate. The signal sequence directs secretion of the protein, 
such as from a eukaryotic host into which the expression vector is transformed, and the 
signal sequence is subsequently or concurrently cleaved. The protein can then be readily 
purified from the extracellular medium by art recognized methods. Alternatively, the 
signal sequence can be linked to the protein of interest using a sequence which facilitates 
purification, such as with a GST domain. 

In another embodiment, the signal sequences of the present invention can be used 
to identify regulatory sequences, e.g. 9 promoters, enhancers, repressors. Since signal 
sequences are the most ammo-terminal sequences of a peptide, it is expected that the 
nucleic acids which flank the signal sequence on its amino-terminal side will be regulatory 
35 sequences which affect transcription. Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to identify and isolate signal sequences 
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and their flanking regions, and these flanking regions can be studied to identify regulatory 
elements therein. 

The present invention also pertains to variants of the polypeptides of the invention. 
Such variants have an altered amino acid sequence which can function as either agonists 

J (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete 
point mutation or truncation. An agonist can retain substantially the same, or a subset, of 
the biological activities of the naturally occurring form of the protein. An antagonist of a 
protein can inhibit one or more of the activities of the naturally occurring form of the 
protein by, for example, competitively binding to a downstream or upstream member of a 

1 0 cellular signaling cascade which includes the protein of interest. Thus, specific biological 
effects can be elicited by treatment with a variant of limited function. Treatment of a 
subject with a variant having a subset of the biological activities of the naturally occurring 
form of the protein can have fewer side effects in a subject relative to treatment with the 
naturally occurring form of the protein. 

Variants of a protein of the invention which function as either agonists (mimetics) 
or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
one embodiment, a variegated library of variants is generated by combinatorial 
2o mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
potential protein sequences is expressible as individual polypeptides, or alternatively, as a 
set of larger fusion proteins {e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the polypeptides of the 
invention from a degenerate oligonucleotide sequence. Methods for synthesizing 
degenerate oligonucleotides are known in the art {see, e.g., Narang (1983) Tetrahedron 
39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1 :477). 

30 In addition, libraries of fragments of the coding sequence of a polypeptide of the 

invention can be used to generate a variegated population of polypeptides for screening 
and subsequent selection of variants. For example, a library of coding sequence fragments 
can be generated by treating a double stranded PCR fragment of the coding sequence of 
interest with a nuclease under conditions wherein nicking occurs only about once per 

35 molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI nuclease, 
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and ligating the resulting fragment library into an expression vector. By this method, an 
expression library can be derived which encodes N-terminal and internal fragments of 
various sizes of the protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. The most widely used techniques, 
which are amenable to high through-put analysis, for screening large gene libraries 
typically include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 
genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a technique which enhances the frequency of functional mutants in the libraries, 
can be used in combination with the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA 59:781 1-7815; 
Delgrave et al. (1993) Protein Engineering 6(3):327-331). 

The polypeptides of the invention can exhibit post-translational modifications, 
including, but not limited to glycosylates, (e.g., N-linked or O-linked glycosylations), 
myristylations, palmitylations, acetylations and phosphorylations (e.g., serine/threonine or 
tyrosine). In one embodiment, the TANGO 253, TANGO 257, INTERCEPT 258 or 
TANGO 281 polypeptides of the invention exhibit reduced levels of O-linked 
glycosylation and/or N-linked glycosylation relative to endogenously expressed TANGO 
253, TANGO 257, INTERCEPT 258 or TANGO 281 polypeptides of the invention do not 
exhibit O-linked glycosylation or N-linked glycosylation. The post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 
polypeptides will vary depending upon the host cell in which in TANGO 253, TANGO 
257, INTERCEPT 258 or TANGO 28 1 is expressed. Further, post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 
polypeptides such as glycosylation can be prevented by treating cells, e.g., with 
tunicamycin. 

An isolated polypeptide of the invention, or a fragment thereof, can be used as an 
immunogen to generate antibodies using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length polypeptide or protein can be used or, 
alternatively, the invention provides antigenic peptide fragments for use as immunogens. 
In one embodiment, an isolated polypeptide or fragment thereof which lacks N- and/or O- 
linked glycosylation is used as an immunogen to generate antibodies using standard 
techniques known to those of skill in the art. The antigenic peptide of a protein of the 
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invention comprises at least 8 (preferably 10, 15, 20, or 30) amino acid residues of the 
amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164 and encompasses an epitope of the protein 
5 such that an antibody raised against the peptide forms a specific immune complex with the 
protein. 

Preferred epitopes encompassed by the antigenic peptide are regions that are 
located on the surface of the protein, e.g., hydrophilic regions. Figures 2, 4, 10, 12, 19, 21, 
29 and 32, are hydropathy plots of the proteins of the invention. These plots or similar 
*® analyses can be used to identify hydrophilic regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
subject, (e.g., rabbit, goat, mouse or other mammal). An appropriate immunogenic 
preparation can contain, for example, recombinantly expressed or chemically synthesized 
j ^ polypeptide. The preparation can further include an adjuvant, such as Freund's complete 
or incomplete adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 

20 molecules, i.e., molecules that contain an antigen binding site which specifically binds an 
antigen, such as a polypeptide of the invention e.g., an epitope of a polypeptide of the 
invention. A molecule which specifically binds to a given polypeptide of the invention is 
a molecule which binds the polypeptide, but does not substantially bind other molecules in 
a sample, e.g., a biological sample, which naturally contains the polypeptide. Examples of 

25 immunologically active portions of immunoglobulin molecules include F(ab) and F(ab')2 
fragments which can be generated by treating the antibody with an enzyme such as pepsin. 
The invention provides polyclonal and monoclonal antibodies. The term "monoclonal 
antibody" or "monoclonal antibody composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an antigen binding site capable of 

30 immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies directed 
against a polypeptide or polypeptides of the invention. Particularly preferred polyclonal 
35 antibody preparations are ones that contain only antibodies directed against a polypeptide 
or polypeptides of the invention. Particularly preferred immunogen compositions are 
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those that contain no other human proteins such as, for example, immunogen 
compositions made using a non-human host cell for recombinant expression of a 
polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will be 
present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in an immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 
immobilized polypeptide. If desired, the antibody molecules can be isolated from the 
mammal (eg., from the blood) and further purified by well-known techniques, such as 
protein A chromatography to obtain the IgG fraction. Alternatively, antibodies specific 
for a protein or polypeptide of the invention can be selected for (e.g., partially purified) or 
purified by, e.g., affinity chromatography. For example, a recombinantly expressed and 
purified (or partially purified) protein of the invention is produced as described herein, and 
covalently or non-covalently coupled to a solid support such as, for example, a 
chromatography column. The column can then be used to affinity purify antibodies 
specific for the proteins of the invention from a sample containing antibodies directed 
against a large number of different epitopes, thereby generating a substantially purified 
antibody composition, i.e., one that is substantially free of contaminating antibodies. By a 
substantially purified antibody composition is meant, in this context, that the antibody 
sample contains at most only 30% (by dry weight) of contaminating antibodies directed 
against epitopes other than those on the desired protein or polypeptide of the invention, 
and preferably at most 20%, yet more preferably at most 10%, and most preferably at most 
5% (by dry weight) of the sample is contaminating antibodies. A purified antibody 
composition means that at least 99% of the antibodies in the composition are directed 
against the desired protein or polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as the hybridoma technique originally 
described by Kohler and Milstein (1975) Nature 256:495-497, the human B cell 
hybridoma technique (Kozbor et al. (1983) Immunol. Today 4:72), the EBV-hybridoma 
technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96) or trioma techniques. The technology for producing hybridomas is well 
known (see generally Current Protocols in Immunology (1994) Coligan et al. (eds.) John 
Wiley & Sons, Inc., New York, NY). Hybridoma cells producing a monoclonal antibody 
of the invention are detected by screening the hybridoma culture supematants for 
antibodies that bind the polypeptide of interest, e.g., using a standard ELISA assay. 
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Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
antibody directed against a polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g. , an antibody phage 
display library) with the polypeptide of interest. Kits for generating and screening phage 

5 display libraries are commercially available (e.g. , the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurJZAP™ Phage Display 
Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly 
amenable for use in generating and screening antibody display library can be found in, for 
example, U.S. Patent No. 5,223,409; PCT Publication No. WO 92/18619; PCT 

10 Publication No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; 
PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al. 
(1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 
3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBOJ. 

15 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the invention. 
A chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. (See, e.g., Cabilly et al., U.S. Patent No. 
4,816,567; and Boss et al., U.S. Patent No. 4,816397, which are incorporated herein by 
reference in their entirety.) Humanized antibodies are antibody molecules from non- 
human species having one or more complementarily determining regions (CDRs) from 
the non-human species and a framework region from a human immunoglobulin molecule. 
(See, e.g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in 
its entirety.) Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example using methods described in 
PCT Publication No. WO 87/02671 ; European Patent Application 1 84, 1 87; European 

}0 Patent Application 1 71 ,496; European Patent Application 1 73,494; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent Application 125,023; Better et 
al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA 
84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et al. (1987) Proc. Natl. 
Acad. Sci. USA 84:214-21 8; Nishimura et al. (1987) Cane. Res. 47:999-1 005; Wood et al. 

* 5 (1985) Nature 314:446-449; and Shaw etal. (1988)7. Natl. Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; U.S. 
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Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science 
239:1534; and Beidler et al. (1988) 1 Immunol 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic treatment 
of human patients. Such antibodies can be produced, for example, using transgenic mice 

^ which are incapable of expressing endogenous immunoglobulin heavy and light chains 
genes, but which can express human heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide of the invention. Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The human immunoglobulin 

10 transgenes harbored by the transgenic mice rearrange during B cell differentiation, and 
subsequently undergo class switching and somatic mutation. Thus, using such a 
technique, it is possible to produce therapeutically useful IgG, IgA and IgE antibodies. 
For an overview of this technology for producing human antibodies, see Lonberg and 
Huszar (1995, Int. Rev. Immunol. 13:65-93). For a detailed discussion of this technology 

1 5 for producing human antibodies and human monoclonal antibodies and protocols for 
producing such antibodies, see; e.g., U.S. Patent 5,625,126; U.S. Patent 5,633,425; U.S. 
Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described 

20 above. 

Completely human antibodies which recognize a selected epitope can be generated 
using a technique referred to as "guided selection." In this approach a selected non-human 
monoclonal antibody, e.g., a mouse antibody, is used to guide the selection of a 
^ completely human antibody recognizing the same epitope. (Jespers et al. (1994) 
Bio/technology 12:899-903). 

An antibody directed against a polypeptide of the invention {e.g., monoclonal 
antibody) can be used to isolate the polypeptide by standard techniques, such as affinity 
chromatography or immunoprecipitation. Moreover, such an antibody can be used to 

30 detect the protein {e.g., in a cellular lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the polypeptide. The antibodies can also be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
be facilitated by coupling the antibody to a detectable substance. Examples of detectable 

35 substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, beta-galactosidase, or 
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acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluminescent materials include luciferase, luciferin, and 
aequorin, and examples of suitable radioactive material include 125 I, 13 J I, 35 S or 3 H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion, A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to cells. Examples include taxol, 
cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof Therapeutic agents include, but are not limited to, antimetabolites 
{e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouraciI 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, 
carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 
doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor 
necrosis factor, d-interferon, P-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("IL-l"), interleukin-2 ("IL-2"), interleukin-6 
("IL-6"), granulocyte macrophase colony stimulating factor ( M GM-CSF"), granulocyte 
colony stimulating factor ( M G-CSF"), or other growth factors. 

Techniques for conjugating a therapeutic moiety to antibodies are well known, see, 
e.g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 
243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in 
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Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, 
Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A 
Review", in Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et 
al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future Prospective Of The 
5 Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in Monoclonal 
Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 
(Academic Press 1985), and Thorpe et al, "The Preparation And Cytotoxic Properties Of 
Antibody-Toxin Conjugates", Immunol. Rev., 62: 1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
1 0 antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

Accordingly, in one aspect, the invention provides substantially purified antibodies 
or fragment thereof, including human, non-human, chimeric, and humanized antibodies, 
which antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino acid sequence encoded by the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221, or 
patent deposit Number PTA-224, or a complement thereof. In another aspect, the 
invention provides substantially purified antibodies or fragments thereof, including 
human, non-human, chimeric and humanized antibodies, which antibodies or fragments 
thereof specifically bind to a polypeptide comprising a fragment of at least 8 contiguous 
amino acid residues, preferably at least 15 contiguous amino acid residues, of the amino 
25 acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
146, 148, 150, 152, 154, 156, 158, 160, 162, or 164. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, non-human, chimeric and humanized antibodies, 

30 which antibodies or fragments thereof, which antibodies or fragments thereof specifically 
bind to a polypeptide comprising an amino acid sequence which is at least 95% identical 
to the amino acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, wherein the percent 

35 identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4. In 
another aspect, the invention provides substantially purified antibodies or fragments 
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thereof, including human, non-human, chimeric and humanized antibodies, which 
antibodies or fragments thereof specifically bind to a polypeptide comprising and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
nucleic acid molecule consisting of any one of SEQ ID Nos:l, 2, 8, 9, 15, 16, 21, 22, 26, 

5 27,37,38,46,47,56,57,77,80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, orthecDNA 
insert of a clone deposited as ATCC® as Accession Number 207222, Accession Number 

1 0 2072 1 5, Accession Number 2072 1 7, Accession number 20722 1 or patent deposit Number 
PTA-224, or a complement thereof, under conditions of hybridization of 6X SSC at 45°C 
and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55°C, 60°C or 65°C. 

In various embodiments, the substantially purified antibodies or fragments thereof 
of the invention are polyclonal, monoclonal, Fab fragments, single chain antibodies, or 
15 F(ab') 2 fragments. The non-human antibodies or fragments thereof of the invention can be 
goat, mouse, sheep, horse, chicken, rabbit or rat antibodies or antibodies fragments. In a 
preferred embodiment, the antibodies of the invention are monoclonal antibodies that 
specifically bind to a polypeptide of the invention. 

20 The substantially purified antibodies or fragments thereof specifically bind to a 

signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof of the invention specifically bind to a secreted sequence or an extracellular domain 

25 of the amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA insert of ATCC® Accession Number 207222, Accession Number 

jo 2072 1 5, Accession Number 2072 1 7, Accession Number 20722 1 , or patent deposit 

Number PTA-224, or a complement thereof. In One embodiment, the extracellular domain 
to which the antibody or antibody fragment binds comprises at least 8 contiguous amino 
acid residues, preferably at least 1 0 or at least 1 5 contiguous amino acid residues, of 
amino acid residues 30 to 206 of SEQ ID NO:28 (SEQ ID NO:76), amino acid residues 

j5 272 to 370 of SEQ ID NO:28 (SEQ ID NO:34); amino acid residues 30 to 249 of SEQ ID 
NO:39 (SEQ ID NO: 83), amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID 
NO:50), or amino acid residues 27 to 112 of SEQ ID NO:58 (SEQ ID NO:61). 
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Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258, and TANGO 281, 
the method comprising immunizing a mammal with a polypeptide. In one embodiment, 
the polypeptide used as an immunogens comprises an amino acid sequence of any one of 
SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162 or 164, or an amino acid sequence encoded by the cDNA insert of a 
clone deposited with the ATCC® as Accession Number 207222, Accession Number 
207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224. In another embodiment, the polypeptide used as an immunogen 
comprises a fragment of at least 15 amino acid residues, preferably at least 25 amino acid 
residues, of the amino acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 
58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino 
acid sequence which is at least 85%, preferably at least 95% identical to the amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a P AMI 20 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4. 

In another embodiment, the polypeptide used as an immunogen comprises an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
nucleic acid molecule consisting of any one of SEQ ID NOs:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27, 37, 38, 46, 47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
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155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or thecDNA 
insert of a clone deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 

5 deposit Number PTA-224, or a complement thereof, under conditions of hybridization of 
6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55 °C, 60°C or 65°C. 
After immunization, a sample is collected from the mammal that contains an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281, a 
fragment thereof, or allellic variant thereof. Preferably, the polypeptide is recombinantly 

1 0 produced using a non-human host cell. Optionally, the antibodies can be further purified 
from the sample using techniques well known to those of skill in the art. The method can 
further comprise producing a monoclonal antibody- producing cell from the cells of the 
mammal Optionally, antibodies are collected from the antibody-producing cell. 

15 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a polypeptide of the invention (or a portion thereof). 
As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 

20 another nucleic acid to which it has been linked. One type of vector is a "plasmid", which 
refers to a circular double stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a 
host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of 

25 replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 

mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the expression of genes to which they 
are operably linked. In general, expression vectors of utility in recombinant DNA 

30 techniques are often in the form of plasmids (vectors). However, the invention is intended 
to include such other forms of expression vectors, such as viral vectors (e.g., replication 
defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
35 invention in a form suitable for expression of the nucleic acid in a host cell. This means 
that the recombinant expression vectors include one or more regulatory sequences, 
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selected on the basis of the host cells to be used for expression, which is operably linked to 
the nucleic acid sequence to be expressed. Within a recombinant expression vector, 
"operably linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner which allows for expression of the nucleotide 

5 sequence (e.g., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). The term "regulatory sequence" is intended to 
include promoters, enhancers and other expression control elements (e.g., polyadenylation 
signals). Such regulatory sequences are described, for example, in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 

10 (1 990). Regulatory sequences include those which direct constitutive expression of a 
nucleotide sequence in many types of host cell and those which direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). 
It will be appreciated by those skilled in the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to be transformed, the level of 

15 expression of protein desired, etc. The expression vectors of the invention can be 
introduced into host cells to thereby produce proteins or peptides, including fusion 
proteins or peptides, encoded by nucleic acids as described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic (e.g., E. coli ) or eukaiyotic 
2 ^ cells (e.g., insect cells (using baculovirus expression vectors), yeast cells or mammalian 
cells). Suitable host cells are discussed further in Goeddel, supra. Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) 

^ to increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 

j5 their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRITS 
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(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et al., (1988) Gene 69:301-3 1 5) and pET 1 Id (Studier et al., Gene Expression 

5 Technology: Methods in Enzymology 1 85, Academic Press, San Diego, California (1990) 
60-89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 
lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host 

10 strains BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 gnl gene 
under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 

^ recombinant protein (Gottesmari, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, California (1990) 1 19-128). Another strategy is to alter 
the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so 
that the individual codons for each amino acid are those preferentially utilized in E. coli 
(Wada et al. (1992) Nucleic Acids Res. 20:21 11-2118). Such alteration of nucleic acid 

20 sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et al. 
(1987) EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), 
pJRY88 (Schultz et al. (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San 
25 Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., Sf 9 
cells) include the pAc series (Smith et al. (1 983) Mol. Cell Biol. 3:21 56-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

30 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329:840) and pMT2PC 
(Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, Adenovirus 2, 
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cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al., supra. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. ScL USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). 

Developmentally-regulated promoters are also encompassed, for example the mouse hox 
promoters (Kessel and Gruss (1990) Science 249:374-379) and the beta-fetoprotein 
promoter (Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a regulatory sequence in a manner which 
allows for expression (by transcription of the DNA molecule) of an RNA molecule which 
is antisense to the mRNA encoding a polypeptide of the invention. Regulatory 
sequences operably linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a variety 
of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be 
chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under 
the control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation of 
gene expression using antisense genes see Weintraub et al. (Reviews - Trends in Genetics, 
Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
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10 



refer not only to the particular subject cell but to the progeny or potential progeny of such 
a cell. Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact, be identical to the 
parent cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. {supra), and other laboratory manuals. 

1 5 For stable transfection of mammalian cells, it is known that, depending upon the 

expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 

20 selectable markers include those which confer resistance to drugs, such as G41 8, 

hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

In another embodiment, the expression characteristics of an endogenous (e.g., 
25 TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes) within a cell, cell 
line or microorganism may be modified by inserting a DNA regulatory element 
heterologous to the endogenous gene of interest into the genome of a cell, stable cell line 
or cloned microorganism such that the inserted regulatory element is operatively linked 
with the endogenous gene (e.g., TANGO 253, TANGO 257, INTERCEPT 258 and 
30 TANGO 281 genes) and controls, modulates or activates. For example, endogenous 

TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes which are normally 
"transcriptionally silent", i.e., a TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 genes which is normally not expressed, or are expressed only at very low 
levels in a cell line or microorganism, may be activated by inserting a regulatory element 
35 which is capable of promoting the expression of a normally expressed gene product in that 
cell line or microorganism. Alternatively, transcriptionally silent, endogenous TANGO 
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253, TANGO 257, INTERCEPT 258 and TANGO 281 genes may be activated by 
insertion of a promiscuous regulatory element that works across cell types. 

A heterologous regulatory element may be inserted into a stable cell line or cloned 
microorganism, such that it is operatively linked with endogenous TANGO 253, TANGO 
5 257, INTERCEPT 258 and TANGO 281 genes, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and 
described eg., in Chappel, U.S. Patent No. 5,272,071; PCT publication No. WO 
91/06667, published May 16, 1991. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce a polypeptide of the invention. Accordingly, the invention further 
provides methods for producing a polypeptide of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of invention 
(into which a recombinant expression vector encoding a polypeptide of the invention has 
■ been introduced) in a suitable medium such that the polypeptide is produced. In another 
embodiment, the method further comprises isolating the polypeptide from the medium or 
the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 

20 oocyte or an embryonic stem cell into which a sequence encoding a polypeptide of the 
invention has been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous sequences encoding a polypeptide of the invention 
have been introduced into their genome or homologous recombinant animals in which 
endogenous encoding a polypeptide of the invention sequences have been altered. Such 

25 animals are useful for studying the function and/or activity of the polypeptide and for 
identifying and/or evaluating modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 

30 dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, an "homologous recombinant animal" is a non-human animal, preferably a 

35 mammal, more preferably a mouse, in which an endogenous gene has been altered by 
homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing nucleic acid 
encoding a polypeptide of the invention (or a homologue thereof) into the male pronuclei 
of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte 
to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency of 
expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to particular 
cells. Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 
4,873,191 and in Hogan, Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986) and Wakayama et al, (1999), Proc. 
Natl. Acad. Sci. USA, 96:14984-14989. Similar methods are used for production of other 
transgenic animals. A transgenic founder animal can be identified based upon the 
presence of the transgene in its genome and/or expression ofmRNA encoding the 
transgene in tissues or cells of the animals. A transgenic founder animal can then be used 
to breed additional animals carrying the transgene. Moreover, transgenic animals carrying 
the transgene can further be bred to other transgenic animals carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which contains 
at least a portion of a gene encoding a polypeptide of the invention into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g, functionally disrupt, the 
gene. In a preferred embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the vector can 
be designed such that, upon homologous recombination, the endogenous gene is mutated 
or otherwise altered but still encodes functional protein {e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous protein). In the 
homologous recombination vector, the altered portion of the gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the gene to allow for homologous recombination to 
occur between the exogenous gene carried by the vector and an endogenous gene in an 
embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the vector 
{see, e.g., Thomas and Capecchi (1987) Cell 51 :503 for a description of homologous 
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recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced gene has homologously recombined 
with the endogenous gene are selected (see, e.g., Li et al. (1992) Cell 69:915). The 
selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 

5 aggregation chimeras (see, e.g., Bradley in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the 

10 homologously recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
1 5 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For a 
description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proa Natl. 
Acad Sci. USA 89:6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
^ recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut et al. (1997) Nature 385:810-813 and PCT 
Publication NOS. WO 97/07668 and WO 97/07669. 

JO 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As 
used herein the language "pharmaceutically acceptable carrier" is intended to include any 



- 103- 



WO 00/78808 



PCT/US00/16883 



and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic 
and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for pharmaceutically active substances 
is well known in the art. Except insofar as any conventional media or agent is 
5 incompatible with the active compound, use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid of the invention. 
Such methods comprise formulating a pharmaceutically acceptable carrier with an agent 

1 0 which modulates expression or activity of a polypeptide or nucleic acid of the invention. 
Such compositions can further include additional active agents. Thus, the invention 
further includes methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or activity 
of a polypeptide or nucleic acid of the invention and one or more additional active 

15 compounds. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
^ sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium 
chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

30 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ 
(BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition 

35 must be sterile and should be fluid to the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
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solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and 
suitable mixtures thereof. The proper fluidity can be maintained, for example, by the use 
of a coating such as lecithin, by the maintenance of the required particle size in the case of 

5 dispersion and by the use of surfactants. Prevention of the action of microorganisms can 
be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, 
sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 

10 compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., a polypeptide or antibody) in the required amount in an appropriate solvent with one 
or a combination of ingredients enumerated above, as required, followed by filtered 

1 5 sterilization. Generally, dispersions are prepared by incorporating the active compound 
into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and freeze-drying which yields a powder of the active ingredient plus any 

20 additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
25 used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches and the like can 

30 contain any of the following ingredients, or compounds of a similar nature: a binder such 
as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 

35 methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g. , a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories {e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
^ enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 

20 polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 
of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 

25 prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,81 1 . 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject to 

30 be treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 

35 such an active compound for the treatment of individuals. 
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For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
5 antibodies. Accordingly, lower dosages and less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize antibodies and to enhance uptake 
and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et al. ((1997) 1 Acquired Immune Deficiency Syndromes and 
Human Retrovirology 14:193). 

*0 As defined herein, a therapeutically effective amount of protein or polypeptide 

(i.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weight, 
and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, 
or 5 to 6 mg/kg body weight. 

The skilled artisan will appreciate that certain factors may influence the dosage 
required to effectively treat a subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health and/or age of the subject, and 
other diseases present. Moreover, treatment of a subject with a therapeutically effective 
amount of a protein, polypeptide, or antibody can include a single treatment or, preferably, 
can include a series of treatments. In a preferred example, a subject is treated with 
antibody, protein, or polypeptide in the range of between about 0.1 to 20 mg/kg body 
weight, one time per week for between about 1 to 10 weeks, preferably between 2 to 8 
weeks, more preferably between about 3 to 7 weeks, and even more preferably for about 4, 
5, or 6 weeks. It will also be appreciated that the effective dosage of antibody, protein, or 
polypeptide used for treatment may increase or decrease over the course of a particular 
treatment. Changes in dosage may result and become apparent from the results of 
diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or activity. 

30 An agent may, for example, be a small molecule. For example, such small molecules 
include, but are not limited to, peptides, peptidomimetics, amino acids, amino acid 
analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic 
or inorganic compounds (z.e,. including heteroorganic and organometallic compounds) 
having a molecular weight less than about 10,000 grams per mole, organic or inorganic 

35 compounds having a molecular weight less than about 5,000 grams per mole, organic or 
inorganic compounds having a molecular weight less than about 1,000 grams per mole, 
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organic or inorganic compounds having a molecular weight less than about 500 grams per 
mole, and salts, esters, and other pharmaceutical^ acceptable forms of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon the 
identity, size, and condition of the subject or sample being treated, further depending upon 
the route by which the composition is to be administered, if applicable, and the effect 
which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. Exemplary doses include milligram or microgram amounts 
of the small molecule per kilogram of subject or sample weight {e.g., about 1 microgram 
per kilogram to about 500 milligrams per kilogram, about 100 micrograms per kilogram 
to about 5 milligrams per kilogram, or about 1 microgram per kilogram to about 50 
micrograms per kilogram. It is furthermore understood that appropriate doses of a small 
molecule depend upon the potency of the small molecule with respect to the expression or 
activity to be modulated. Such appropriate doses may be determined using the assays 
described herein. When one or more of these small molecules is to be administered to an 
animal {e.g., a human) in order to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or researcher may, for example, 
prescribe a relatively low dose at first, subsequently increasing the dose until an 
appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, gender, 
and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (U.S. Patent 5,328,470) or by 
stereotactic injection {see, e.g., Chen et al. (1994) Proc. Natl. Acad. Set USA 
91 :3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 
pharmaceutical preparation can include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 
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V. Uses and Methods of t he Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies 
described herein can be used in one or more of the following methods: a) screening assays; 
b) detection assays (e.g., chromosomal mapping, tissue typing, forensic biology); c) 

5 predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials, 
and pharmacogenomics); and d) methods of treatment (e.g., therapeutic and prophylactic). 
The isolated nucleic acid molecules of the invention can be used to express proteins (e.g., 
via a recombinant expression vector in a host cell in gene therapy applications), to detect 
mRNA (e.g., in a biological sample) or a genetic lesion, and to modulate activity of a 

10 polypeptide of the invention. In addition, the polypeptides of the invention can be used to 
screen drugs or compounds which modulate activity or expression of a polypeptide of the 
invention as well as to treat disorders characterized by insufficient or excessive production 
of a protein of the invention or production of a form of a protein of the invention which 
has decreased or aberrant activity compared to the wild type protein. In addition, the 

1 5 antibodies of the invention can be used to detect and isolate a protein of the and modulate 
activity of a protein of the invention. 

This invention further pertains to novel agents identified by the 
above-described screening assays and uses thereof for treatments as described herein. 

20 

A. Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to polypeptide of the 
invention or have a stimulatory or inhibitory effect on, for example, expression or activity 
of a polypeptide of the invention. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of a 

30 polypeptide of the invention or biologically active portion thereof. The test compounds of 
the present invention can be obtained using any of the numerous approaches in 
combinatorial library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and synthetic 

35 library methods using affinity chromatography selection. The biological library approach 
is limited to peptide libraries, while the other four approaches are applicable to peptide, 
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non-peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer 
DrugDes. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al. (1993) Proc. Natl Acad. Set USA 90:6909; Erb et al. 
5 (1994) Proc. Natl Acad. Sci. USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chern. 
37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. 
Engl 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl 33:2061; and Gallop et al. 
(1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten (1992) 
Bio/Techniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent NOS. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al. (1992) Proc. Natl Acad. Set 
USA 89:1865-1869) or phage (Scott and Smith (1990) Science 249:386-390; Devlin 
(1990) Science 249:404-406; Cwirla et al. (1990) Proc. Natl Acad. Sci. USA 
87:6378-6382; andFelici (1991) J. Mol. Biol. 222:301-310). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of a polypeptide of the invention, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

20 compound to bind to the polypeptide determined. The cell, for example, can be a yeast 
cell or a cell of mammalian origin. Determining the ability of the test compound to bind 
to the polypeptide can be accomplished, for example, by coupling the test compound with 
a radioisotope or enzymatic label such that binding of the test compound to the 
polypeptide or biologically active portion thereof can be determined by detecting the 

25 labeled compound in a complex. For example, test compounds can be labeled with *25j f 
35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, test compounds 
can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 

30 conversion of an appropriate substrate to product. In a preferred embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of a polypeptide of 
the invention, or a biologically active portion thereof, on the cell surface with a known 
compound which binds the polypeptide to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to interact 

35 with the polypeptide, wherein determining the ability of the test compound to interact with 
the polypeptide comprises determining the ability of the test compound to preferentially 
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bind to the polypeptide or a biologically active portion thereof as compared to the known 
compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of a polypeptide of the invention, or a biologically 

5 active portion thereof, on the cell surface with a test compound and determining the ability 
of the test compound to modulate {e.g., stimulate or inhibit) the activity of the polypeptide 
or biologically active portion thereof. Determining the ability of the test compound to 
modulate the activity of the polypeptide or a biologically active portion thereof can be 
accomplished, for example, by determining the ability of the polypeptide protein to bind to 

10 or interact with a target molecule. 

Determining the ability of a polypeptide of the invention to bind to or interact with 
a target molecule can be accomplished by one of the methods described above for 
determining direct binding. As used herein, a "target molecule" is a molecule with which 
a selected polypeptide (e.g., a polypeptide of the invention) binds or interacts with in 
nature, for example, a molecule on the surface of a cell which expresses the selected 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic 
molecule. A target molecule can be a polypeptide of the invention or some other 

20 polypeptide or protein. For example, a target molecule can be a component of a signal 
transduction pathway which facilitates transduction of an extracellular signal (e.g., a signal 
generated by binding of a compound to a polypeptide of the invention) through the cell 
membrane and into the cell or a second intercellular protein which has catalytic activity or 
a protein which facilitates the association of downstream signaling molecules with a 
polypeptide of the invention. Determining the ability of a polypeptide of the invention to 
bind to or interact with a target molecule can be accomplished by determining the activity 
of the target molecule. For example, the activity of the target molecule can be determined 
by detecting induction of a cellular second messenger of the target (e.g.> intracellular 
Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target on an 

^ appropriate substrate, detecting the induction of a reporter gene (e.g., a regulatory element 
that is responsive to a polypeptide of the invention operably linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
35 comprising contacting a polypeptide of the invention or biologically active portion thereof 
with a test compound and determining the ability of the test compound to bind to the 
polypeptide or biologically active portion thereof. Binding of the test compound to the 
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polypeptide can be determined either directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the polypeptide of the invention or 
biologically active portion thereof with a known compound which binds the polypeptide 
to form an assay mixture, contacting the assay mixture with a test compound, and 
5 determining the ability of the test compound to interact with the polypeptide, wherein 
determining the ability of the test compound to interact with the polypeptide comprises 
determining the ability of the test compound to preferentially bind to the polypeptide or 
biologically active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-free assay comprising contacting a 
10 polypeptide of the invention or biologically active portion thereof with a test compound 
and determining the ability of the test compound to modulate (e.g., stimulate or inhibit) 
the activity of the polypeptide or biologically active portion thereof Determining the 
ability of the test compound to modulate the activity of the polypeptide can be 
accomplished, for example, by determining the ability of the polypeptide to bind to a 
15 target molecule by one of the methods described above for determining direct binding. In 
an alternative embodiment, determining the ability of the test compound to modulate the 
activity of the polypeptide can be accomplished by determining the ability of the 
polypeptide of the invention to further modulate the target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be 
20 determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting a polypeptide 
of the invention or biologically active portion thereof with a known compound which 
binds the polypeptide to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with the 
polypeptide, wherein determining the ability of the test compound to interact with the 
polypeptide comprises determining the ability of the polypeptide to preferentially bind to 
or modulate the activity of a target molecule. 

The cell-free assays of the present invention are amenable to use of both a soluble 
30 form or the membrane-bound form of a polypeptide of the invention. In the case of 
cell-free assays comprising the membrane-bound form of the polypeptide, it may be 
desirable to utilize a solubilizing agent such that the membrane-bound form of the 
polypeptide is maintained in solution. Examples of such solubilizing agents include 
non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-octylmaltoside, 
3 5 octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton X- 1 00, Triton X- 1 1 4, 
Thesit, Isotridecypoly(ethylene glycol ether)n, 

3-[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 
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3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or 
N-dodecyl=N,N-dimethyl-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, 
it may be desirable to immobilize either the polypeptide of the invention or its target 
molecule to facilitate separation of complexed from uncomplexed forms of one or both of 
the proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to the polypeptide, or interaction of the polypeptide with a target molecule in 
the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants. Examples of such vessels include microtitre plates, 
test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transferase fusion proteins or 
glutathione-S-transferase fusion proteins can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical; St. Louis, MO) or glutathione derivatized microtitre plates, which 
are then combined with the test compound or the test compound and either the 
non-adsorbed target protein or A polypeptide of the invention, and the mixture incubated 
under conditions conducive to complex formation {e.g., at physiological conditions for salt 
and pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components and complex formation is measured either directly or indirectly, 
for example, as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the polypeptide of the invention can be 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the polypeptide of the invention or 
its target molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated polypeptide of the invention or target molecules can be prepared from 
biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., 
biotinylation kit, Pierce Chemicals; Rockford, IL), and immobilized in the wells of 
streptavidin-coated 96 welt plates (Pierce Chemical). Alternatively, antibodies reactive 
with the polypeptide of the invention or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target molecule can be derivatized to the 
wells of the plate, and unbound target or polypeptide of the invention trapped in the wells 
by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the polypeptide of the invention or target 
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molecule, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a polypeptide of the invention 
are identified in a method in which a cell is contacted with a candidate compound and the 
expression of the selected mRNA or protein (i.e.. the mRNA or protein corresponding to a 
polypeptide or nucleic acid of the invention) in the cell is determined. The level of 
expression of the selected mRNA or protein in the presence of the candidate compound is 
compared to the level of expression of the selected mRNA or protein in the absence of the 
candidate compound. The candidate compound can then be identified as a modulator of 
expression of the polypeptide of the invention based on this comparison. For example, 
when expression of the selected mRNA or protein is greater (statistically significantly 
greater) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the selected mRNA or protein expression. 
Alternatively, when expression of the selected mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as an inhibitor of the selected mRNA or protein 
expression. The level of the selected mRNA or protein expression in the cells can be 
determined by methods described herein. 

In yet another aspect of the invention, a polypeptide of the inventions can be used 
as "bait proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993)/. Biol. Chem. 
268:12046-12054; Bartel et al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) 
Oncogene 8:1693-1696; and PCT Publication No. WO 94/10300), to identify other 
proteins, which bind to or interact with the polypeptide of the invention and modulate 
activity of the polypeptide of the invention. Such binding proteins are also likely to be 
involved in the propagation of signals by the polypeptide of the inventions as, for 
example, upstream or downstream elements of a signaling pathway involving the 
polypeptide of the invention. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 

B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: (i) map their respective genes on a 
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chromosome and, thus, locate gene regions associated with genetic disease; (ii) identify an 
individual from a minute biological sample (tissue typing); and (iii) aid in forensic 
identification of a biological sample. These applications are described in the subsections 
below. 

5 

L Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. Accordingly, 
10 nucleic acid molecules described herein or fragments thereof, can be used to map the 
location of the corresponding genes on a chromosome. The mapping of the sequences to 
chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, genes can be mapped to chromosomes by preparing PCR primers 
15 (preferably 15-25 bp in length) from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can be used to rapidly select primers 
that do not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
20 human gene corresponding to the gene sequences will yield an amplified fragment. For a 
review of this technique, see D'Eustachio et al. ((1983) Science 220:919-924). 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular sequence to a particular chromosome. Three or more sequences can be assigned 
per day using a single thermal cycler. Using the nucleic acid sequences of the invention to 
design oligonucleotide primers, sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies which can similarly be used to map 
a gene to its chromosome include in situ hybridization (described in Fan et al. (1990) 
Proc. Natl Acad. Sci. USA 87:6223-27), pre-screening with labeled flow-sorted 
chromosomes (CITE), and pre-selection by hybridization to chromosome specific cDNA 

30 

libraries. Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. For a review of this technique, see Verma et al., (Human Chromosomes: A Manual 
of Basic Techniques (Pergamon Press, New York, 1988)). 

35 Reagents for chromosome mapping can be used individually to mark a single 

chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
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noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
5 position of the sequence on the chromosome can be correlated with genetic map data. 
(Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man, 
available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
1 0 described in, e.g., Egeland et al. (1987) Nature 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with a gene of the invention can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
20 confirm the P resence °f a mutation and to distinguish mutations from polymorphisms. 

Furthermore, the nucleic acid sequences disclosed herein can be used to perform 
searches against <4 mapping databases", e.g., BLAST-type search, such that the 
chromosome position of the gene is identified by sequence homology or identity with 
known sequence fragments which have been mapped to chromosomes. 

25 in the instant case, the human gene for INTERCEPT 258 has been mapped to the 

long arm of chromosome 11, in the region q23. Flanking markers for this region are 
Dl 1S936 and Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 
(ecotodermal dysplasia), JBS (Jacobsen Syndrome), TCPT (thrombocytopenia) loci also 
map to this region of the human chromosome. The APOLP 1 (apoplipoprotein cluster), 

30 DRD2 (dopamine receptor), and RDX (radixin) genes also map to this region of the 
human chromosome. This region is syntenic to mouse chromosome 9. The atm (ataxia 
telangiectasia), ruf (rough fur), and vs (variable spotting) loci map to this region of the 
mouse chromosome. The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), 
rug (rough fur), and lapl (leucine arylarninopeptidase) genes also map to this region of the 

35 mouse chromosome. 
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A polypeptide and fragments and sequences thereof and antibodies specific thereto 
can be used to map the location of the gene encoding the polypeptide on a chromosome. 
This mapping can be carried out by specifically detecting the presence of the polypeptide 
in members of a panel of somatic cell hybrids between cells of a first species of animal 

5 from which the protein originates and cells from a second species of animal and then 
determining which somatic cell hybrid(s) expresses the polypeptide and noting the 
chromosome(s) from the first species of animal that it contains. For examples of this 
technique, see Pajunen et al. (1988) Cytogenet. Cell Genet 47:37-41 and Van Keuren et 
al (1986; Hum. Genet. 74:34-40. Alternatively, the presence of the polypeptide in the 

10 somatic cell hybrids can be determined by assaying an activity or property of the 

polypeptide, for example, enzymatic activity, as described in Bordelon-Riser et al. (1979; 
Somatic Cell Genetics 5:597-613 and Owerbach et al (1978; Proc. Natl. Acad. Sci. USA 
75:5640-5644. 



2. Tissue Typing 

The nucleic acid sequences of the present invention can also be used to identify 
individuals from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for identification 

20 of its personnel. In this technique, an individual's genomic DNA is digested with one or 
more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
sequences of the present invention are useful as additional DNA markers for RFLP 

25 (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
alternative technique which determines the actual base-by-base DNA sequence of selected 
portions of an individual's genome. Thus, the nucleic acid sequences described herein can 
be used to prepare two PCR primers from the 5 f and 3* ends of the sequences. These 
30 primers can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the present 
invention can be used to obtain such identification sequences from individuals and from 
tissue. The nucleic acid sequences of the invention uniquely represent portions of the 
human genome. Allelic variation occurs to some degree in the coding regions of these 
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sequences, and to a greater degree in the noncoding regions. It is estimated that allelic 
variation between individual humans occurs with a frequency at about once per each 500 
bases. Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 

5 Because greater numbers of polymorphisms occur in the noncoding regions, fewer 

sequences are necessary to differentiate individuals. The noncoding sequences of SEQ ID 
NO:l, 8, 15, 21, 26, 37, 46 or 56, can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID 

10 NO:2, 9, 16, 22, 27, 38, 47 or 57 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

If a panel of reagents from the nucleic acid sequences described herein is used to 
generate a unique identification database for an individual, those same reagents can later 
be used to identify tissue from that individual. Using the unique identification database, 
1 5 positive identification of the individual, living or dead, can be made from extremely small 
tissue samples. 



3. Use of Partial Gene Sequences in Forensic Biology 

20 DNA-based identification techniques can also be used in forensic biology. 

Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator of 
a crime. To make such an identification, PCR technology can be used to amplify DNA 
sequences taken from very small biological samples such as tissues, e.g., hair or skin, or 

25 body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified sequence 
can then be compared to a standard, thereby allowing identification of the origin of the 
biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 

30 

enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme generated 
fragments. Sequences targeted to noncoding regions are particularly appropriate for this 

35 

use as greater numbers of polymorphisms occur in the noncoding regions, making it easier 
to differentiate individuals using this technique. Examples of polynucleotide reagents 
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include the nucleic acid sequences of the invention or portions thereof, e.g., fragments 
derived from noncoding regions having a length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
5 example, an in situ hybridization technique, to identify a specific tissue, e.g., brain tissue. 
This can be very useful in cases where a forensic pathologist is presented with a tissue of 
unknown origin. Panels of such probes can be used to identify tissue by species and/or by 
organ type. 

10 

C. Predictive Medicine 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 

1 5 aspect of the present invention relates to diagnostic assays for determining TANGO 253, 
TANGO 257, INTERCEPT 258, or TANGO 281 protein and/or nucleic acid expression as 
well as TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 activity, in the 
context of a biological sample (e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or disorder, or is at risk of developing a 

20 disorder, associated with aberrant or unwanted TANGO 253, TANGO 257, INTERCEPT 
258, or TANGO 281 expression or activity. The invention also provides for prognostic (or 
predictive) assays for determining whether an individual is at risk of developing a disorder 
associated with TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, 
nucleic acid expression or activity. For example, mutations in a TANGO 253, TANGO 

25 257, INTERCEPT 258, or TANGO 281 gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of a disorder characterized by or associated with TANGO 
253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, nucleic acid expression or 
activity. 

30 

As an alternative to making determinations based on the absolute expression level 
of selected genes, determinations may be based on the normalized expression levels of 
these genes. Expression levels are normalized by correcting the absolute expression level 
of a TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 gene by comparing its 
expression to the expression of a gene that is not a TANGO 253, TANGO 257, 
INTERCEPT 258, or TANGO 281 gene, e.g., a housekeeping gene that is constitutively 
expressed. Suitable genes for normalization include housekeeping genes such as the actin 
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gene. This normalization allows the comparison of the expression level in one sample, 
e.g., a patient sample, to another sample, e.g., a sample from an individual without a 
particular disease or disorder, or a sample from a healthy individual, or between samples 
from different sources. 

* Alternatively, the expression level can be provided as a relative expression level. 

To determine a relative expression level of a gene, the level of expression of the gene is 
determined for 10 or more samples of different cell isolates (e.g., neural cell isolates, glial 
cell isolates, immune cell isolates, platelet isolates, megakaryocyte isolates, endothelial 
cell isolates, and osteocyte isolates) preferably 50 or more samples, prior to the 

I® determination of the expression level for the sample in question. The mean expression 
level of each of the genes assayed in the larger number of samples is determined and this 
is used as a baseline expression level for the gene(s) in question. The expression level of 
the gene determined for the test sample (absolute level of expression) is then divided by 
the mean expression value obtained for that gene. This provides a relative expression 

1 5 level and aids in identifying extreme cases of diseases and disorders such as obesity, 
coronary disorders (e.g., atherosclerosis), neuronal disorders, pulmonary disorders, renal 
disorders, and bleeding disorders. 

Preferably, the samples used in the baseline determination will be from diseased or 
from non-diseased cells of the appropriate cell type or tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in normal 
tissues as a mean expression score aids in validating whether the TANGO 253, TANGO 
257, INTERCEPT 258, or TANGO 281 gene assayed is specific (versus normal cells). 
Such a use is particularly important in identifying whether a TANGO 253, TANGO 257, 
INTERCEPT 258, or TANGO 281 gene can serve as a target gene. In addition, as more 
data is accumulated, the mean expression value can be revised, providing improved 
relative expression values based on accumulated data. Expression data from cells provides 
a means for grading the severity of the disease or disorder state. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g., 
30 drugs, compounds) on the expression or activity of TANGO 253, TANGO 257, 

INTERCEPT 258, or TANGO 281 in clinical trials. These and other agents are described 
in further detail in the following sections. 

1. Diagnostic Assays 

35 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid of the invention in a biological sample involves obtaining a biological sample 
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from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting a polypeptide or nucleic acid {e.g., mRNA, genomic DNA) of the 
invention such that the presence of a polypeptide or nucleic acid of the invention is 
detected in the biological sample. A preferred agent for detecting mRNA or genomic 

5 DNA encoding a polypeptide of the invention is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA encoding a polypeptide of the invention. The 
nucleic acid probe can be, for example, a full-length cDNA, such as the nucleic acid of 
SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

1 0 sufficient to specifically hybridize under stringent conditions to a mRNA or genomic 
DNA encoding a polypeptide of the invention. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

A preferred agent for detecting a polypeptide of the invention is an antibody 
capable of binding to a polypeptide of the invention, preferably an antibody with a 

1 5 detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof {e.g., Fab or F(ab')2> can be used - Th © term "labeled", 
with regard to the probe or antibody, is intended to encompass direct labeling of the probe 
or antibody by coupling {i.e., physically linking) a detectable substance to the probe or 
antibody, as well as indirect labeling of the probe or antibody by reactivity with another 

20 reagent that is directly labeled. Examples of indirect labeling include detection of a 

primary antibody using a fluorescently labeled secondary antibody and end-labeling of a 
DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. 
The term "biological sample" is intended to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. That 

25 is, the detection method of the invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. For example, in vitro techniques 
for detection of mRNA include Northern hybridizations and in situ hybridizations. In 
vitro techniques for detection of a polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 

*0 immunofluorescence. In vitro techniques for detection of genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for detection of a polypeptide of the 
invention include introducing into a subject a labeled antibody directed against the 
polypeptide. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

\5 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
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subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 

5 capable of detecting a polypeptide of the invention or mRNA or genomic DNA encoding a 
polypeptide of the invention, such that the presence of the polypeptide or mRNA or 
genomic DNA encoding the polypeptide is detected in the biological sample, and 
comparing the presence of the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of the polypeptide or mRNA or 

1 0 genomic DNA encoding the polypeptide in the test sample. 

The invention also encompasses kits for detecting the presence of a polypeptide or 
nucleic acid of the invention in a biological sample (a test sample). Such kits can be used 
to determine if a subject is suffering from or is at increased risk of developing a disorder 
j associated with aberrant expression of a polypeptide of the invention, as discussed, for 
example, in sections above relating to uses of the sequences of the invention. 

For example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as coronary disorders (e.g., heart diseases and disorders 
such as atherosclerosis., coronary artery disease and plaque formation), and adipocyte- 

20 related disorders (e.g., obesity), which are associated with aberrant TANGO 253 

expression. In another example, kits can be used to determine if a subject is suffering 
from or is at increased risk of disorders such as coronary disorders (e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plague formation), olfactory 
disorders, neurological disorders (e.g., neurodegenerative disorders, neuromuscular 

25 disorders, cognitive disorders, personality disorders, and motor disorder) and pulmonary 
disorders, (e.g., cystic fibrosis), which are associated with aberrant TANGO 257 
expression. In another example, kits can be used to determine if a subject is suffering from 
or is at increased risk of disorders such as Type I immunologic disorders, (e.g., 
anaphylaxis and rhinitis), which are associated with aberrant INTERCEPT 258 expression. 

30 In another example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as immunological disorders, (e.g. thrombocytopenia and 
platelet disorders), developmental disorders, coronary disorders, e.g., ischemic heart 
disease or atherosclerosis, neurological disorders, (e.g., head trauma and brain cancer), 
pulmonary disorders, (e.g., lung cancer, cystic fibrosis and rheumatoid lung disease), 

35 kidney disorders, (e.g., glomerulonephritis and end stage renal disease), autoimmune 
disorders, (e.g., Crohn's disease) and embryonic disorders, which are associated with 
aberrant TANGO 281 expression. The kit, for example, can comprise a labeled compound 
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or agent capable of detecting the polypeptide or mRNA encoding the polypeptide in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 
the sample (e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
instructions for observing that the tested subject is suffering from or is at risk of 
developing a disorder associated with aberrant expression of the polypeptide if the amount 
of the polypeptide or mRNA encoding the polypeptide is above or below a normal level. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide of the invention; and, 
optionally, (2) a second, different antibody which binds to either the polypeptide or the 
first antibody and is conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide of the invention or (2) a pair of primers useful for 
amplifying a nucleic acid molecule encoding a polypeptide of the invention. The kit can 
also comprise, e.g., a buffering agent, a preservative, or a protein stabilizing agent. The 
kit can also comprise components necessary for detecting the detectable agent (e.g., an 
enzyme or a substrate). The kit can also contain a control sample or a series of control 
samples which can be assayed and compared to the test sample contained. Each 
component of the kit is usually enclosed within an individual container and all of the 
various containers are within a single package along with instructions for observing 
whether the tested subject is suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide. 

2. Prognostic Assays 

The methods described herein can furthermore be utilized as diagnostic or 
prognostic assays to identify subjects having or at risk of developing a disease or disorder 
associated with aberrant expression or activity of a polypeptide of the invention. For 
example, the assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity of a polypeptide of the invention, 
e.g., coronary disorders, pulmonary disorders, kidney disorders or embryonic disorders. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk 
for developing such a disease or disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a polypeptide or nucleic acid (e.g., 
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mRNA, genomic DNA) of the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject having or at risk of developing a 
disease or disorder associated with aberrant expression or activity of the polypeptide. As 
used herein, a "test sample" refers to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological fluid {e.g., serum), cell sample, or 
tissue. 

The prognostic assays described herein, for example, can be used to identify a 
subject having or at risk of developing disorders such as disorders discussed, for example, 
in Sections above relating to uses of the sequences of the invention; 

For example, such disorders can include coronary disorders (e.g., heart diseases 
and disorders such as atherosclerosis, coronary artery disease and plague formation) and 
adipocyte disorders (eg., obesity), which are associated with aberrant TANGO 253 
expression. In another example, prognostic assays described herein, can be used to 
identify a subject having or at risk of developing disorders such as coronary disorders 
(e.g., heart diseases and disorders such as atherosclerosis, coronary artery disease and 
plague formation), olfactory disorders, neurological disorders (e.g., neurodegenerate 
disorders, neuromuscular disorders, cognitive disorders, personality disorders, and motor 
disorders), and pulmonary disorders, (e.g., cystic fibrosis), which are associated with 
aberrant TANGO 257 expression. In another example, prognostic assays described herein, 
can be used to identify a subject having or at risk of developing disorders such as Type I 
immunologic disorders, (e.g., anaphylaxis and rhinitis), which are associated with aberrant 
INTERCEPT 258 expression. In another example, prognostic assays described herein, for 
example, can be used to identify a subject having or at risk of developing disorders such as 
immunological disorders, (e.g. thrombocytopenia and platelet disorders), developmental 
disorders, coronary disorders, (e.g., ischemic heart disease and atherosclerosis), 
neurological disorders, (e.g., head trauma and brain cancer), pulmonary disorders, (e.g., 
lung cancer, cystic fibrosis and rheumatoid lung disease), kidney disorders, (e.g., 
glomerulonephritis and end stage renal disease), autoimmune disorders, (e.g., Crohn's 
disease) and embryonic disorders, which are associated with aberrant TANGO 281 
expression. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
35 treat a disease or disorder associated with aberrant expression or activity of a polypeptide 
of the invention. For example, such methods can be used to determine whether a subject 
can be effectively treated with a specific agent or class of agents (e.g., agents of a type 
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which decrease activity of the polypeptide). Thus, the present invention provides methods 
for determining whether a subject can be effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a polypeptide of the invention in which a 
test sample is obtained and the polypeptide or nucleic acid encoding the polypeptide is 
detected (e.g., wherein the presence of the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a disorder associated with aberrant 
expression or activity of the polypeptide). 

The methods of the invention can also be used to detect genetic lesions or 
mutations in a gene of the invention, thereby determining if a subject with the lesioned 
gene is at risk for a disorder characterized aberrant expression or activity of a polypeptide 
of the invention. In preferred embodiments, the methods include detecting, in a sample of 
cells from the subject, the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting the integrity of a gene encoding the 
polypeptide of the invention, or the mis-expression of the gene encoding the polypeptide 
of the invention. For example, such genetic lesions or mutations can be detected by 
ascertaining the existence of at least one of: 1) a deletion of one or more nucleotides from 
the gene; 2) an addition of one or more nucleotides to the gene; 3) a substitution of one or 
more nucleotides of the gene; 4) a chromosomal rearrangement of the gene; 5) an 
alteration in the level of a messenger RNA transcript of the gene; 6) an aberrant 
20 modification of the gene, such as of the methylation pattern of the genomic DNA; 7) the 
presence of a non-wild type splicing pattern of a messenger RNA transcript of the gene; 8) 
a non-wild type level of a the protein encoded by the gene; 9) an allelic loss of the gene; 
and 10) an inappropriate post-translational modification of the protein encoded by the 
gene. As described herein, there are a large number of assay techniques known in the art 
25 which can be used for detecting lesions in a gene. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PGR) {see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) {see, e.g., Landegran et al. (1988) Science 241 : 1077- 1080; and Nakazawa 
et al. (1994) Proc. Natl. Acad. Sci. USA 91 :360-364), the latter of which can be 
particularly useful for detecting point mutations in a gene {see, e.g., Abravaya et al. (1995) 
Nucleic Acids Res. 23:675-682). This method can include the steps of collecting a sample 
of cells from a patient, isolating nucleic acid {e.g., genomic, mRNA or both) from the cells 
of the sample, contacting the nucleic acid sample with one or more primers which 
specifically hybridize to the selected gene under conditions such that hybridization and 
amplification of the gene (if present) occurs, and detecting the presence or absence of an 



30 



35 



-125- 



WO 00/78808 



PCT/US00/16883 



10 



15 



amplification product, or detecting the size of the amplification product and comparing the 
length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with any of the techniques used for 
detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 6:1197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in a selected gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use of sequence specific 
ribozymes (see, e.g., U.S. Patent No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations can be identified by hybridizing a sample 
and control nucleic acids, e.g., DNA or RNA, to high density arrays containing hundreds 
or thousands of oligonucleotides probes (Cronin et al. (1996) Human Mutation 7:244-255; 

25 Kozal et al. (1 996) Nature Medicine 2:753-759). For example, genetic mutations can be 
identified in two-dimensional arrays containing light-generated DNA probes as described 
in Cronin et al., supra. Briefly, a first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to identify base changes between 
the sequences by making linear arrays of sequential overlapping probes. This step allows 

30 the identification of point mutations. This step is followed by a second hybridization array 
that allows the characterization of specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

35 In yet another embodiment, any of a variety of sequencing reactions known in the 

art can be used to directly sequence the selected gene and detect mutations by comparing 
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the sequence of the sample nucleic acids with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those based on techniques developed 
by Maxim and Gilbert ((1977) Proc. Natl. Acad. Sci. USA 74:560) or Sanger ((1977) 
Proc. Natl. Acad. ScL USA 74:5463). It is also contemplated that any of a variety of 
5 automated sequencing procedures can be utilized when performing the diagnostic assays 
((1995) Bio/Techniques 19:448), including sequencing by mass spectrometry ( see, e.g., 
PCT Publication No. WO 94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; 
and Griffin et al. (1993) Appl. Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in a selected gene include methods in which 
10 protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 230:1242). In general, the 
technique of mismatch cleavage entails providing heteroduplexes formed by hybridizing 
(labeled) RNA or DNA containing the wild-type sequence with potentially mutant RNA or 
DNA obtained from a tissue sample. The double-stranded duplexes are treated with an 
1 5 agent which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. RNA/DNA duplexes can be 
treated with RNase to digest mismatched regions, and DNA/DNA hybrids can be treated 
with SI nuclease to digest mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated 
with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is 
then separated by size on denaturing polyacrylamide gels to determine the site of 
mutation. See, e.g., Cotton et al. (1988) Proc. Natl. Acad. Set. USA 85:4397; Saleeba et al. 
25 (1992) Methods Enzymol. 217:286-295. In a preferred embodiment, the control DNA or 
RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 

30 in cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a selected sequence, e.g., a 
wild-type sequence, is hybridized to a cDNA or other DNA product from a test cell(s). 

35 The duplex is treated with a DNA mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in genes. For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant and 
wild type nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. USA 86:2766; see also 

5 Cotton (1993) Mutat. Res. 285:125-144; Hayashi (1992) Genet. Anal. Tech. Appl 

9:73-79). Single-stranded DNA fragments of sample and control nucleic acids will be 
denatured and allowed to renature. The secondary structure of single-stranded nucleic 
acids varies according to sequence, and the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The DNA fragments may be labeled or 

1 0 detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA), in which the secondary structure is more sensitive to a change in 
sequence. In a preferred embodiment, the subject method utilizes heteroduplex analysis to 
separate double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet. 7:5). 

1 5 In yet another embodiment, the movement of mutant or wild-type fragments in 

polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When DGGE 
is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of approximately 40 bp of 

20 high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control and 
sample DNA (Rosenbaum and Reissner (1987) Biophys. Chem. 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki et al. (1989) Proc. Natl Acad. Sci. USA 86:6230). Such allele specific 
3o oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
35 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
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end of one primer where, under appropriate conditions, mismatch can prevent or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition, it may be desirable 
to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al. (1992) MoL Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl Acad. Sci. USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3* end of the 5' sequence making it possible to detect 
the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e.g., in clinical settings to 
diagnose patients exhibiting symptoms or family history of a disease or illness involving a 
gene encoding a polypeptide of the invention. Furthermore, any cell type or tissue, e.g., 
chondrocytes, in which the polypeptide of the invention is expressed may be utilized in the 
prognostic assays described herein. 

3. Pharmacogenomics 

Agents, or modulators which have a stimulatory or inhibitory effect on activity or 
expression of a polypeptide of the invention as identified by a screening assay described 
herein can be administered to individuals to treat (prophylactically or therapeutically) 
disorders associated with aberrant activity of the polypeptide. In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an 
25 individual's genotype and that individual's response to a foreign compound or drug) of the 
individual may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the 
individual permits the selection of effective agents (e.g., drugs) for prophylactic or 
30 therapeutic treatments based on a consideration of the individual's genotype. Such 

pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of a polypeptide of the invention, expression of a 
nucleic acid of the invention, or mutation content of a gene of the invention in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
35 prophylactic treatment of the individual. 
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Pharmacogenetics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See, e.g., Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 

5 single factor altering the way drugs act on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the main 

1 0 clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 

15 2) and cytochrome P450 enzymes CYP2D6 and CYP2C 1 9) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated drug 
response and serious toxicity after taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

20 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, a PM will show no therapeutic response, 

25 as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 

metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene amplification. 

30 Thus, the activity of a polypeptide of the invention, expression of a nucleic acid 

encoding the polypeptide, or mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used 
to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 

^ identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and 
thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
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modulator of activity or expression of the polypeptide, such as a modulator identified by 
one of the exemplary screening assays described herein. 

4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of a polypeptide of the invention (e.g., the ability to modulate aberrant cell 
proliferation chemotaxis, and/or differentiation) can be applied not only in basic drug 
screening, but also in clinical trials. For example, the effectiveness of an agent, as 

1 o determined by a screening assay as described herein, to increase gene expression, protein 
levels or protein activity, can be monitored in clinical trials of subjects exhibiting 
decreased gene expression, protein levels, or protein activity. Alternatively, the 
effectiveness of an agent, as determined by a screening assay, to decrease gene expression, 
protein levels or protein activity, can be monitored in clinical trials of subjects exhibiting 

!5 increased gene expression, protein levels, or protein activity. In such clinical trials, 
expression or activity of a polypeptide of the invention and preferably, that of other 
polypeptide that have been implicated in for example, a cellular proliferation disorder, can 
be used as a marker of the immune responsiveness of a particular cell. 

For example, and not by way of limitation, genes, including those of the invention, 
20 that are modulated in cells by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates activity or expression of a polypeptide of the invention (e.g., 
as identified in a screening assay described herein) can be identified. Thus, to study the 
effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells can 
be isolated and RNA prepared and analyzed for the levels of expression of a gene of the 
25 invention and other genes implicated in the disorder. The levels of gene expression (i.e., a 
gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the levels of activity of a gene of the 
invention or other genes. In this way, the gene expression pattern can serve as a marker, 
30 indicative of the physiological response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various points during, treatment of the 
individual with the agent. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 

35 

peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) comprising the steps of (i) obtaining a 
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pre-administration sample from a subject prior to administration of the agent; (ii) detecting 
the level of the polypeptide or nucleic acid of the invention in the preadministration 
sample; (iii) obtaining one or more post-administration samples from the subject; (iv) 
detecting the level the of the polypeptide or nucleic acid of the invention in the 

5 post-administration samples; (v) comparing the level of the polypeptide or nucleic acid of 
the invention in the pre-administration sample with the level of the polypeptide or nucleic 
acid of the invention in the post-administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased 
administration of the agent may be desirable to increase the expression or activity of the 

10 polypeptide to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of the polypeptide to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

15 

C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant expression or activity of a polypeptide of the invention, as discussed, for 
20 example, in sections above relating to uses of the sequences of the invention. 

For example, disorders characterized by aberrant expression or activity of the 
polypeptides of the invention include immunologic disorders, coronary disorders, 
pulmonary disorders, neurological disorders, kidney disorders, and autoimmune disorders. 
The nucleic acids, polypeptides, and modulators thereof of the invention can be used to 

25 treat immunologic diseases and disorders, including but not limited to, allergic disorders 
(e.g., anaphylaxis and allergic asthma) autoimmune and inflammatory disorders (e.g., 
atopic dermatitis). Polypeptides of the invention can be used to treat diseases associated 
with bacterial infection (e.g., tuberculosis, e.g., pulmonary tuberculosis), inflammatory 
arthropathy, and bone and cartilage degenerative diseases and disorders (e.g., arthritis, 

30 e.g., rheumatoid arthritis). Polypeptides of the invention can be used to treat pulmonary 
disorders such as lung cancer, cystic fibrosis and rheumatoid lung diseases. Polypeptides 
of the invention can be used to treat coronary disorders, such as ischemic heart disease, 
atherosclerosis and plague formation. Polypeptides of the invention can also be used to 
treat neurological disorders such as neurodegenerate disorders, neuromuscular disorders 

35 and cognitive disorders. Polypeptides of the invention can also be used to treat kidney 
disorders such as glomerulonephritis and end stage renal disease. Further, polypeptides of 
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the invention can be used to treat autoimmune disorders such as Crohns disease, and other 
disorders described herein. 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant expression or activity of a polypeptide of 
the invention, by administering to the subject an agent which modulates expression or at 
least one activity of the polypeptide. Subjects at risk for a disease which is caused or 

1 0 contributed to by aberrant expression or activity of a polypeptide of the invention can be 
identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the 
manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of 

1 5 aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. 
For example, an antagonist of a TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 
281 proteins may be used to treat an immunologic disorder, e.g., rheumatoid arthritis. The 
appropriate agent can be determined based on screening assays described herein. 

20 

* 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression or 
activity of a polypeptide of the invention for therapeutic purposes. The modulatory 
method of the invention involves contacting a cell with an agent that modulates one or 

25 more of the activities of the polypeptide. An agent that modulates activity can be an agent 
as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of the polypeptide, a peptide, a peptidomimetic, or other small molecule. In one 
embodiment, the agent stimulates one or more of the biological activities of the 
polypeptide. Examples of such stimulatory agents include the active polypeptide of the 

3Q invention and a nucleic acid molecule encoding the polypeptide of the invention that has 
been introduced into the cell. In another embodiment, the agent inhibits one or more of 
the biological activities of the polypeptide of the invention. Examples of such inhibitory 
agents include antisense nucleic acid molecules and antibodies. These modulatory 
methods can be performed in vitro {e.g., by culturing the cell with the agent) or, 

35 alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a polypeptide of the invention. In one 
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embodiment, the method involves administering an agent {e.g., an agent identified by a 
screening assay described herein), or combination of agents that modulates {e.g., 
upregulates or downregulates) expression or activity. In another embodiment, the method 
involves administering a polypeptide of the invention or a nucleic acid molecule of the 
5 invention as therapy to compensate for reduced or aberrant expression or activity of the 
polypeptide. 

Stimulation of activity is desirable in situations in which activity or expression is 
abnormally low or downregulated and/or in which increased activity is likely to have a 
beneficial effect. Conversely, inhibition of activity is desirable in situations in which 
1 0 activity or expression is abnormally high or upregulated and/or in which decreased activity 
is likely to have a beneficial effect. 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 



Deposit of Clones 

Clones containing cDNA molecules encoding human TANGO 253, (clone 
EpT253) human TANGO 257 (EpT257), human INTERCEPT 258 (clone EpT258) and 
human TANGO 281 (clone EpT 281) were deposited with the American Type Culture 
Collection, 10801 University Boulevard, Manassas, VA, 201 10-2209, on April 21, 1999 
as Accession Number 207222, as part of a composite deposit representing a mixture of 
strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

25 For this composite deposit, to distinguish the strains and isolate a strain harboring 

a particular cDNA clone, an aliquot of the mixture can be streaked out to single colonies 
on nutrient medium {e.g., LB plates) supplemented with lOOg/ml ampicillin, single 
colonies grown, and then plasmid DNA extracted using a standard minipreparation 
procedure. Next, a sample of the DNA minipreparation can be digested with a 

30 combination of the restriction enzymes Sail, Noil, Xbal and EcorV and the resultant 

products resolved on a 0.8% agarose gel using standard DNA electrophoresis conditions. 
The digest liberates fragments as follows: 

Human TANGO 253 (clone EpT253): 1.3 kb 

35 

Human TANGO 257 (clone EpT257): 1 .8 kb 
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Human INTERCEPT 258 (clone EpT258): 1 .0 kb and 0.85 kb (human 
INTERCEPT 258 has a EcorV cut site at about bp 1 004). 

Human TANGO 281 (clone EpT281): 0.9 kb and 0.9kb (human TANGO 
281 Has an Xbal cut site at about bp 900). 

5 

The identity of the strains can be inferred from the fragments liberated. 

Clones containing cDNA molecules encoding mouse INTERCEPT 258 were 
deposited with the American Type Culture Collection (Manassas, VA) on April 21, 1999 
10 as Accession Number 20722 1 , as part of a composite deposit representing a mixture of 
five strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, 
an aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g., 
LB plates) supplemented with lOO^tg/ml ampicillin, single colonies grown, and then 
plasmid DNA extracted using a standard minipreparation procedure. Next, a sample of the 
DNA minipreparation can be digested with a combination of the restriction enzymes Sail, 
and Noil, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 

20 

Mouse INTERCEPT 258 (clone EpT258): 1.8 kb 
The identity of the strains can be inferred from the fragments liberated. 

25 

A clone containing a cDNA molecule encoding mouse TANGO 253 (Clone EpTm 
253) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207215. 

A clone containing a cDNA molecule encoding mouse TANGO 257 (Clone EpTm 
257) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207217. 

A clone containing a cDNA molecule encoding mouse TANGO 281 (Clone EpTm 
281) was deposited with American Type Culture Collection, 10801 University Boulevard, 
35 Manassas, VA 201 10-2209, on June 1 5, 1 999 as patent deposit Number PTA-224. 
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All publications, patents and patent applications mentioned in this specification are 
herein incorporated by reference into the specification to the same extent as if each 
individual publication, patent or patent application was specifically and individually 
indicated to be incorporated herein by reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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MICROORGANISMS 

Optional Sheet in connection with the microorganism referred to on pages , lines of the description ' 

A. IDENTIFICATION OF DEPOSIT ■ 

Further deposits are identified on an additional sheet ' 

Name of depositary institution 4 
American Type Culture Collection 



Address of depositary institution (including postal code and country) * 

10801 University Blvd. 
Manassas, VA 20110-2209 
US 



Date of deposit * April 21 , 1 999 Accession Number ' 2072 1 5 

B. ADDITIONAL INDICATIONS ' (leave Wank if not applicable). This information it continued on a tepante attached sheet 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE • c* a* « 



D. SEPARATE FURNISHING OF INDICATIONS • (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later ' (Specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



E. □ This sheet was received with the International application when riled (to be checked by the receiving Office) 



(Authorized Officer) 
□ The date of receipt (from the applicant) by the International Bureau * 



was 



(Authorized Officer) 
EormPfrVRfl/iM (January 19811 : 
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What is claimed is: 

I. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:l, 2, 26, 27, 46, 47, the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number 207222, or a 
complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:l, 2, 15, 16, 26, 27, 46, 47, the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:3, 28, 48, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:3, 28, 48, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222; 

e) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 17, 28, 48, or the 
amino acid sequence encoded by the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number 207222, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule comprising SEQ ID NO:2, 16, 27, 47, or a complement thereof 
under stringent conditions; 

f) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ID NO:21, 22, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207217, or a complement 
thereof; 

g) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ED NO:21 , 22, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207217, or a complement thereof; 



- 137- 



WO 00/78808 



PCT/US00/16883 



h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207217; 

i) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:23, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207217, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207217; 

j) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:23, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:22, or a complement thereof under stringent conditions; 

k) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:37, 38, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 20722 1 , or a complement 
thereof; 

1) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:37, 38, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207221 , or a complement thereof; 

m) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 20722 1 ; 

n) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:39, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207221, wherein the fragment comprises at least 160 contiguous amino acids of 
SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207221; 

o) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:39, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
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Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:38, or a complement thereof under stringent conditions; 

p) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:8, 9, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207215, or a complement 
thereof; 

q) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:8, 9, the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number 2072 1 5, or a complement thereof; 

r) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207215; 

s) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207215, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207215; 

t) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207215, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:9, or a complement thereof under stringent conditions; 

u) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ID NO: 15, 16, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

v) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 17, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO: 17, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207222. 
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w) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the 
plasmid deposited with the ATCC® as patent deposit Number PTA-224, or a complement 
thereof; 

x) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224, or a complement thereof; 

y) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224; 

z) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent deposit 
Number PTA-224, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224; 

aa) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:58, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent 
deposit Number PTA-224, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO: 57, or a complement thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1 , which is selected from the 
group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1 , 2, 1 5, 
16, 26, 27, 46, 47, the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207222, or a complement thereof; 

b) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

c) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:2 1 , 22, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207217, 
or a complement thereof; 
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d) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207217; 

e) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:37, 38, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207221, 
or a complement thereof; 

0 a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ED NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207221; 

g) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 8, 9, the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207215, or 
a complement thereof; 

h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222. 

i) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:56, 57, 
the cDNA insert of the plasmid deposited with the ATCC® as patent deposit Number 
PTA-224, or a complement thereof; 

j) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224. 

3. The nucleic acid molecule of claim 1 further comprising vector nucleic acid 
sequences. 

4. The nucleic acid molecule of claim 1 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell. 
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7. A non-human mammalian host cell containing the nucleic acid molecule of 
claim 1 . 



8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of plasmids deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, patent deposit Number PTA-224 wherein the polypeptide is encoded by 
a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:2, 9, 16, 22, 27, 38, 47, 57, or a complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence which is at least 45% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:2, 9, 27, 38, 47, 57, or at least 98% to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO:2, 9, 27, 38, 47, 57, or a complement 
thereof. 



9. The isolated polypeptide of claim 8 comprising the amino acid sequence of 
SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58. 



10. The polypeptide of claim 8 further comprising heterologous amino acid 
sequences. 

11. An antibody which selectively binds to a polypeptide of claim 8. 



12. A method for producing a polypeptide selected from the group consisting 

of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 10, 
17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, Accession Number 
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207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224; 

b) a polypeptide comprising a fragment of the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 
deposit Number PTA-224, wherein the fragment comprises at least 15 contiguous amino 
acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit Number PTA-224; and 

c) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, or patent deposit Number PTA-224, wherein the polypeptide is encoded 
by a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ 
ID NO:l, 8, 15, 21, 26, 37, 46, 56, or a complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in which the nucleic 
acid molecule is expressed. 

13. A method for detecting the presence of a polypeptide of claim 8 in a 
sample, comprising: 

a) contacting the sample with a compound which selectively binds to a 
polypeptide of claim 8; and 

b) determining whether the compound binds to the polypeptide in the sample. 

14. The method of claim 1 3, wherein the compound which binds to the 
polypeptide is an antibody. 

15. A kit comprising a compound which selectively binds to a polypeptide of 
claim 8 and instructions for use. 
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16. A method for detecting the presence of a nucleic acid molecule of claim 1 
• in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer which selectively 
hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample. 

17. The method of claim 16, wherein the sample comprises mRNA molecules 
and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of claim 1 and instructions for use. 

1 9. A method for identifying a compound which binds to a polypeptide of 
claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a polypeptide of claim 8 with 
a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

20. The method of claim 19, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detecting of test compound/polypeptide 
binding; 

b) detection of binding using a competition binding assay; 

c) detection of binding using an assay for TANGO 253, TANGO 257, 
INTERCEPT 258, TANGO 281-mediated signal transduction. 

21. A method for modulating the activity of a polypeptide of claim 8 
comprising contacting a polypeptide or a cell expressing a polypeptide of claim 8 with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 
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22. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 8, comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 
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GTCGACCCACGCGTCCCXXSACTGGG^^ 7 9 

CACCAACTGGAGGGTCCXTCAGTAGCGAGCGCCCCGAAGGAGGCGA^ 158 
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CCGGCGTCCGGGCTCCCGGTGCCAGCGCT ATG AGG CCA CTC CTC GTC CTG CTG CTC CTG GGC CTG 22 3 
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AGTGK3CTCTCTGC(1ATCA(XjTCTG^ 1156 

GGCAAGTCTAAGTCCCCCAGTTGCTCTGGTCCAGG^ 1235 

ATCCTCCCCACCCCCTCCTCCTCCTG^ 1314 
AAAAAAAAAAAAAAAGGGCGGCCGC 1339 
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kRPLLVLLLLGLAAGSPPLDDNKIPSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPG 
APGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSD 
APLPFDRV^VNEQGHYDAVTGKFTCQ VPGVYYFAVHATVYRAS LQFDLVKN 
FFGGWPKPASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSP 
VFA 
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GTCGACCCACGCGTCCGCGCTGTGAAGCCAG 7 9 

M R P L L A 6 

CTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCGGGGTCACCACC ATG AGG CCA CTT CTT GCC 152 

I'LL LG.LVS G-SPPLD. DNKI PS 26 

CTT CTG CTT CTG GGT CTG GTG TCA GGC TCT CCT CCT CTG GAC GAC AAC AAG ATC CCC AGC 212 

L C P G Q PG L P GTPGHHGS QGL 46 

CTG TGT CCC GGG CAG CCC GGC CTT CCA GGC ACA CCA GGT CAC CAT GGC AGC CAA GGC CTG 272 

PGR D G R D G R DG A P G A PG E K G 66 

CCT GGC CGT GAC GGC CGT GAT GGC CGC GAC GGT GCA CCC GGA GCT CCG GGA GAG AAA GGC 332 

EGGRP GLPGPR GEPGPRGEA 86 

GAG GGC GGG AGA CCG GGA CTA CCT GGC CCA CGT GGG GAG CCC GGG CCG CGT GGA GAG GCA 392 

GPMGAXGPAGECSVP PR SAF 106 

GGG CCC ATG GGG GCT ATC GGG CCT GCG GGG GAG TGC TCG GTA CCC CCA CGA TCA GCC TTC 4 52 

SAKRSESRVPPPADTPLPFD 126 

AGT GCC AAG CGA TCC GAG AGC CGG GTA CCT CCG CCA GCC GAC ACA CCC CTA CCT TTC GAC 512 

R V L L N E Q G H Y D P T T G K F T C Q 146 

CGT GTG CTG CTA AAT GAG CAG GGC CAT TAC GAC CCC ACT ACT GGC AAG TTC ACC TGC CAA 572 

VPGVYYF AVHATVYRASLQF 166 

GTG CCT GGC GTC TAC TAC TTT GCT GTG CAC GCC ACT GTC TAC CGG GCC AGC TTG CAG TTT 632 

D L V K N G Q S I A S F F Q Y F G G W P 186 

SAT CTT GTC AAA AAC GGG CAG TCC ATC GCC TCT TTC TTC CAG TAT TTT GGG GGG TGG CCC 692 

KPASLSGGAMV RLEPEDQVW 206 

\AG CCA GCC TCG CTC TCA GGG GGT GCG ATG GTA AGG CTA GAA CCT GAG GAC CAG GTG TGG 752 

VQVGVGDY IGIYAS. IKTD ST 226 

3TG CAG GTG GGC GTG GGT GAT TAC ATT GGC ATC TAT GCC AGC ATC AAG ACA GAC AGT ACC 812 

FS G F I* V Y ' S DWHSS PV FA* 244 

*TC TCT GGA TTT CTC GTC TAT TCT GAC TGG CAC AGC TCC CCA GTC TTC GCT TAA 866 

AACACAGTG^CXrCGGAGCTGGCACTT^ 945 

TGGCCCCCTGGAATATTGTGAATGACTTAGGAAGA 1024 

•GCTGTCTGAGGTC^GACAGOG^ 1103 

TGGGTCCTCGCCCAGGACTC^ 1182 
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GCTGCTCCCA.GOGCA.GG C CTTTTTCTCAGAGGTCACTTAATAAAC CTAAAATCXTCAAAAAAAAAAAAAAAGGGCGGC C 1261 
GC 1263 
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>wT253 

MRPLLALLLUSLVSGSPPLDDNKIPSUTPGQPGL^^ 

APGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAFSAKRSESRVPPPAD 
TPLPFDRVLLNEQGHYDPTTGKFTCQVPGVYYFAVHATVYRASLQFD 
YFGGWPKPASLSGGAMVRLEPEDQVV7VQVGVGDYIG I YAS I KTDSTFSGFLVYS DWHS S P 
VFA 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Killer, CABIOS C19B9) 

> hT2S3 a. a. 243 aa vs. 

> raT253 a. a. 243 a a 
scoring matrix; paml20.mat, gap penalties: -12/-4 

93.8% Identity; Global alignment score: 1239 

10 20 30 40 SO 60 70 

inputs MRPIXVMAU;LAACSPPU>DmaPSLCPGHPGLPGTO 

:::::.::::::..::::::::::::::::.:;::::::;;:::::;;::;:;:;::;;::;::;:;::: 
MRPLLALIXUJLVSGSPPLDDNKIPSLCF^ 

10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs PCLPGPRGDPCPRGE^GPAGPTGPACECSVTPRSArSAKRSESRV^ 

::::..::.::::::::: : . . • • . . . . ..».••»■ ....... • 

PGLPGPRGEPGPRGEAGPKGAIGPAGECSVPPRSATSAWISESRN^PPADTPLPFDRVLLNEOGHYDPTT 
80 90 100 110 120 130 140 

ISO 160 170 180 190 200 210 

inputs GKFTCQVPGVYYFAVHATVYRASl^FDLVKNGE 

::::::::::::::::::::::::::::::::.:::»:::.:::::::::::::::;::;;;:::::::: 
G KFTCQVFCVYY F A VHATVYRAS LQ Ft> L VKHGQ S I AS FFQ Y FGGWP KP AS LSGGAMVRL E P EDQVWVQVG 
150 160 170 180 190 200 210 

220 230. 240 

inputs VGDYIGI YAS IKTDSTFSGFLVYSDWHSS PVFA 

VGOYZGIYAS I KTDSTFSG FLVYS D WHS S PVFA 
220 230 240 



FIG. 5 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT253 a. a. 243 aa vs * 

> SwissProt Q1584B - (untitled) 244 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

38.7% identity; Global alignment score: 262 

10 20 30 40 50 60 

inputs MRPL-LVLLLLGLAA- - -GSPPLDDNKIPSL CPG - HPGLPGTPGHHGSQGLPGRDGRDGRDGAPGA 

1 . . . • • : :: .:.:::::•:•• : :::: : :.:. 

f^Ll/3A^LUALPG 

10 20 30 40 SO 60 70 

70 80 90 100 110 120 130 

inputs PGEKGEC^RPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSDAPLPFTDRVLVN 

: : :.::::.: : : . : : :::::. : - : 

PGLIG PKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS AFS VGL- ETYVTI P - NMP I RFTKI FYN 

60 90 100 110 120 130 

140 ISO 160 170 180 190 200 

inputs EQGHYDAVTGKFTCQVTGVYYFAVHATVYRASLQro IAS FFQFFGGWPKPASLSGGAMVRLEP 

• ... •••• : ... •: :•• • • * ••••••••* 

QQNHYDGSTGKJra^IPGL'WFAYH 
140 150 160 170 180 190 200 

210 220 230 240 

inputs EDQVWVQ V - G VGD Y IG I YAS I KTDS T FSG FLVYS D WHS S PVFA 
•••« • • • ::•:•:::•• :•• • 

G^VWLQWGEGERNGLYADmDNDSTF^^ HDT N 

210 220 230 240 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CAB10S (1989) 

> mT253 a. a. 243 aa vs. 

> SwissProt Q15848 - (untitled) 244 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

38.3% identity; Global alignment score: 264 

10 20 30 40 50 60 

inputs MRPLLALLLLGLVSGSPPLDDNKIPSL - -CPG-QPGLPGTPGHHGSQGLPGRDGRDGRDGAPGA 

• ••••• m ■« * • •••■»■* • •••« « » • 

• • ••••• • a a • • • » * » • ■ »»•♦••• • % ■ B ■ 

MLLLGAVXiLLUVLPGHIXiETTTQGPGVl^LPLPKGACTGWMAG 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs PGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAFSAKRSESRVPPPADTPLPFDRVLLN 

PGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVTRSAFSVGL-ETYVTIP-NMPIRFTKIFTN 
80 90 100 110 120 130 

140 150 160 170 180 190 200 

inputs EQGHYDPTTGKFTCQVT**vY¥FAV 

••*• ••••• mm ;»...•• 

• ••••• •••••••••• • • • • ••* • a •• • • ■ •••••»»•• 

<X}NHYIXjSTGKF7ICNIPGLYYFAYHITVYMKDVXVSLFKK^ 
140 150 160 170 180 190 200 

210 220 230 240 

inputs edqvwvqv-gvgdyigiyasiktdstfsgflvysdwhsspvfa 

• ••••••• • * • •••■•«•■ • 

GDQVWLQVYGEGERNGL YADNDNDS TFTG FLLY HDT N 

210 220 230 240 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cites Myers and Miller, CABIOS (1989) 

> HT253 n.a. 1339 aa vs. 

> adipocyte n.a. (AI417523) 653 aa 
scoring matrix: para 120. mat, gap penalties: -12/-4 

29.1% identity; Global alignment score: -1168 

10 20 30 40 50 60 70 

inputs GTCGACCCACGCGTCC^GGACTGGGGTGACGCCAGOGCAGGOGGCGCCTGGCCGGGGAGAAGCGCGGGGG 
: . ::.: : .::: :.. :. :::: :* t..i :: ::. 

TTTTTT GCAT — GTAACTTTTTTATTGA GGCA C AACAAGGC ATTGT AACTTG CCTGG A 

10 20 30 40 50 

60 90 100 110 120 130 140 

inputs CTGGAGCACCACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGG 

. • . . a *• . • ■ ••• • • • T * _ * • ••••• 

CTTGAG GCAGT CAGTTTAGTAAGCT GAA< CGTTAATACAGTTAA 

60 70 80 90 

150 160 / 170 180 190 200 210 

inputs GGGGACTGCGAGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCTATGAGGCCACTCCTCGTCCTGCT 
::..•: : : : : . : . 

GGATTAAG TGCAAACAATATA CATTC AC A 

100 110 120 

220 230 240 250 260 270 280 

inputs GCTCCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGACGACAACAAGATCCCCAGCCTCTGCCCGGGGCAC 
::: ::• ::.:::. :::: ::..:: ::: .: :: an • • 

GCT — TGA — CTAGCGA — GGCT ACATCA— CAATTTATAAAG TGCCAGA 

130 140 150 160 170 

290 300 310 320 330 340 350 

inputs CCCGGCCTTCCAGGCACGCCGGGCCACCATGGCAGCCAGGGCTTGCCGGGCCGCGATGGC0GCX5ACGGCC 



XX — AGT GCTAA TTGTCATTCA — GCTTG ATTTTTCAC 

180 190 200 

360 370 380 390 400 410 420 

inputs GTC ACGG CGTC CCCGGGGCTCCGGG AG AGAAAGGCGAGGGCGGG AGG 

::: ::•::..::: :.:... t .::: .** 

— CTCAGGAAGGAAAA — CAAAAAAGTAAGG ACC TCCTC 

210 220 230 

430 440 450 460 470 480 490 

inputs GGACCCCGGGCCGCGAGGAGAGGCGGGACCCX3CGGGGCCC& 
1 1 : it i s • 

— CCTCTAGGAA- — ~ 

240 

500 510 520 530 540 550 560 

inputs CCGC«ATCCGCCTTCAGCGCCA^ 

:.:.:.:. : : : : 
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CAAAAAACATTTTCCT AAACCAA 

250 260 270 

570 580 590 600 610 620 630 

inputs TCGACCGCX5TGCTGGTGAACGAGCAGCGACATTACGACGCCGTCACCGGCAAGTTCACCTGCCAGGTGCC 
5: *« 2 ::: . •:• . : : ::. 

TCAGTC - — ATGA — GGGCAAAGAC — TACTTTTCCTTCA ATC-CCA — CTAAT 

280 290 300 , 310 



640 650 660 670 680 690 700 

inpu t s TGGGGTCTACTACTTCGCCGTCCATGCCACCGTCTACCGGGCCAGCCTGCAGTTTGATCTGGTG AAGAAT 



TAGAA CACCATCC TTTTAT T 

320 330 

710 720 730 740 750 760 770 

inputs GGCGAATCCATTGCCTCTTTCTTCCAGTTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGG 

GTCAATACTGT ACTGACTTTCAAT CTTG 

340 350 360 



780 790 800 810 820 830 840 

inputs CCATGGTGAGGCTGGAGCCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTA 

: • 

— AT AAAG AAG AT — AG CCTG AAAAC GTAGAATAT 

370 380 390 

850 860 870 880 890 900 910 

inputs TGCCAGCATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCCAGTC 

• ••••••• 

TTCCAGCTACT TCCATAAAT TGCTCCCCTGT- 

400 410 420 



920 930 940 950 960 970 980 

inputs TTTGCTTAGTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGAGGGTGTGAGGCTGACAACC 

GCAGACGT — 
430 

990 1000 1010 1020 1030 1040 1050 

inputs TGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAGGGAGGTGGGGTAGAGCACTCTCC 
t.:x: :ts: t nststu : 

7 AACCATAT CTGGTCTCCCTGGAA -GAGCTGAAGAATTGCATGAT — 

440 450 460 470 

1060 1070 1080 1090 1100 1110 1120 

inputs GTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTGTCTGOGAT 

*«**•»** i ..tt tit tttt tt tt z tt ttt tt it «nn 

TGCTAGCA -GTTTCA-TGG -TCTG-GAGCA C CATCATTGG-CATAGGCT 

480, 490 500 510* 520 

1130 1140 1150 1160 1170 1180 1190 
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input 8 GGATTTCTGCCCAAGACCAGATOAGTGTGCTGTC 

insists* :t s::i ::: 

GAT A ■ CCAAGACCT -CTT — CATTCTTCANTGAG GTTG-AC — ATACAG 

530 540 550 560 

1200 1210 1220 1230 1240 1250 1260 

inputs GAGCCCACGGTGGGGTGCTCTCTTCCTGGTCCTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCT 
•*s 2: : • : : ♦ 

TGGCACAT TCACTGCCAG — CTTTTACATGTGAAAAA TGAAAAACGT 

570 580 590 600 



1270 1280 1290 1300 1310 1320 1330 

inputs GGGGCCGGCCCTTTTCTCAGAGATCACTCAATAAACCTAAGAACCCTCCAAAAAAAAAAAAAAAAAAAAG 
• : •••• .: :: ;:: ::: 

AGTGCCA TTCACTTGG — CA ATTAAATCTA CCAAAGCTGAGATCAAA — 

610 620 630 640 650 



inputs GGCGGCCGC 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller , CABIOS (1989) 

> roT253 n.a. 1263 aa vs. 

> adipocyte n.a. (AI417523) (553 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

30,4% identity; Global alignment score: -840 

10 . 20 30 40 50 . 60 70 

inputs GTCGACCCACGCGTCCGCGCTGTGAAGCCAGCAAGGAGCAACCAGAAGCTAGGAGTCAGTCAGCAAGGAC 
: : : .:::.: :•:.. .. ::: :: ::.::::: : 

TT TTT TGCATGTAACTT TTTTATTGAGGCA — CAACAAGG-C 

10 20 30 

80 90 100 110 120 130 140 

inputs AGGGGCTGCCTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCGGGGTCACCACCATGAGG 

• • ••••• ••••••• ••••••» 

ATTG TAACT TGCCTGGA CTTGAGG 

40 50 60 

150 160 170 180 190 200 210 

inputs CCACTTCTTGCCCTTCTGCTTCTGGGTCTGGTGTCAGGCTCTCCTCCTCTGGACGACAACAAGATCCCCA 



C AG TCAGTTT AGTAAG CTG AACGTTAATA 

70 80 90 

220 230 240 250 260 270 280 

inputs GCCTGTGTCCCGGGCAGCCCGGCCTTCCAGGCACACCAGGTCACCATGGCAGCCAAGGCCTGCCTGGCCG 

• _ • • • ! _ • ••••• • • • • •••>•«••• 

— CAGTTA — AGGA TTAAGTGCAAACAATAT ACATTCACAGCTTGACTAGC-G 

100 110 120 130 140 

290 300 310 320 330 340 350 

inputs TGACGGCCGTGATGGCCGCGACGGTGCACCCGGAGCTCCGGGAGAGAAAGGCGAGGGCGGGAGACCGGGA 

* i _ f _ f • • • •••» •••• 

AGGCTAC « ATCACAATTTATAAAGTGC CAGATTA GTG 

• 150 160 170 

360 370 380 390 400 410 420 

inputs CTACCTGGCCCACGTGGGGAGCCCGGGCCX5CGTGGAGAGGC 

ss: it : «:• it ::•. • si ::. tt ..:•• 

CTAATTGTCATTCA : GCTTGATTTTTCA CCTCAGGAA -GGAAAACAA 

180 190 200 210 220 

430 440 450 460 470 480 490 

inputs GGGAGTGCTCGGTACCCCCACGATCAGCCTTGAGTGCCAAGC 

•••:*:. •::::::•: si: *•::*:.:•*•!•*• :.::: 

AAAAGTA AGGACCTCCTC -OCT CTAG-GAACAAAAAAC-ATTTTCCTA 

230 240 250 260 

500 510 520 530 540 550 560 

inputs CGA»C^CCCCTACCTTTCGAC^ 

•:::••*:•• :•::•: ::.::•• ..:• tt.t t.t ::::::: : «n 
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AACCAATCAGTCATGAG-GGCAAAGACTACTTTTCCTT--CAATCCCACTAAT"-TAG 

270 280 290 300 310 

570 580 590 600 610 620 630 

inpu 1 6 TTCACCTGCCAAGTGCCTGGCGTCTACTACTTTGCTGTGCACGCCACTGTCTACCGGGCCAGCTTGCAGT 

:. : : 

AACACCATCCTTTTA — TTG TCAATACTGT ACTGACTT 

320 330 340 350 

640 650 660 670 680 690 700 

inputs TTGATCTTGTCAAAAACGGGCAGTCCATCGCCTCTTTCTTCCAGTATTTTGGGGGGTGGCCCAAGCCAGC 



TCAATCTT GATAAAGAAGATAGCC 

360 370 

710 720 730 740 750 760 770 

inputs CTCGCTCTCAGGGGGTGCGATGGTAAGGCTAGAACCTGAGGACCAGGTGTGGGTGCAGGTGGGCGTGGGT 

: : . : 

TGAAAACGTAGAA TATTTCCAG C TAG 

380 390 400 

780 790 800 810 820 830 840 

i npu t s G ATTACAT TGGCATCTATGCCAGCATCAAG ACAGACAGTACCTTCTCTGGATTTCTCGTCTATTCTGACT 



• • • i 



— TTCCATAAATTGCT CC — C — CTGTGCAGACGTAACCATATCTGG — -TCTC — C CT 

410 420 430 440 450 

850 860 870 880 890 900 910 

input s GGCACAGCTCCCCAGTCTTCGCTTAAAACACAGTGAACCCGGAGCTGGCACTTGCTCCTCAGTGGAGGGT 
!J * !2. *: : : ::: 

GG AAGAGCTGA — AG AATT-G C ATG ATT GCTAGCAGTTTC ATGGT 

460 470 480 490 

920 930. 940 950 960 970 980 

inputs GTGACACTAACCCGCGCAGCGCATACCAGGAGGGCTGGCCCCCTGGAATATTGTGAATGACTTAGGAAGA 

: . s : s :::: 

CTGGA -GCACC ATCATTGGCATAGGCTGA 

500 510 520 

990 1000 1010 1020 1030 1040 1050 

inputs GAGGGAGCCACTTCCAGTCCCACTGCTGGCAATGAATGGAGACAGGCTGTCTC 

t.tix. tt.it t. t.. t:. tttttt .lit: 

TACCAAGACCTCTTCATTCTT -CAM- TGAGGT — TGACA — 

530 540 S50 

1060 1070 1080 1090 1100 1110 1120 

inputs AGCAGTGGCTGGGTTTCTGCCCAGGACTTTAGAATGCAGTAG 

••ssitsss. . : .ttsis *«ssss t.tt in:... tt. 

TACAGTGGCACATTCACTGCC AGCTTT TACA TGTGAAAAATGA AAAA 

560 570 560 590 600 

1130 1140 1150 1160 1170 1180 1190 
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input 8 CTCCAAGGTGGGATGCTCCATTCCTAGTCCTGTGTCCCCTCTAGGTCCCTGACTCCATCTCTGCTGCTCC 
: .::.:t • •* 

C GTAGTG CCATTC ACTTGG CAAT TAAATCTAC 

610 620 630 

1200 1210 1220 1230 1240 1250 1260 

inpu t S CAGGGCAGGCCTTTTTCTCAGAGGTCACTTAATAAACCTAAAATCCTCAAAAAAAAAAAAAAAGGGCGGC 

::: ::::: 

CAAAGCTG AGA TCAAA , 

640 650 



inputs CGC 
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GTCGACCC^CXSCGTCCGCGGACGCGTGCOTGAGGGGAAGAGGCTC 7 9 

MGPSTPLLILFLLSW S G 17 

AGGCTGCC ATG GGG CCC AGC ACC CCT CTC CTC ATC TTG TTC CTT TTG TCA TGG TCG GGA 138 

PLQGQQHHLVEYMERRLAAL 37 

CCC CTC CAA GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC CGA CTA GCT GCT TTA 198 

EERLA'QC. QDQSSRHAAELRD 57 

GAG GAA CGG CTG GCC CAG TGC CAG GAC CAG AGT AGT CGG CAT GCT GCT GAG CTG CGG GAC 258 

FKNKMLP LLEVAEKER EAL R 77 

TTC AAG AAC AAG ATG CTG CCA CTG CTG GAG GTG GGA GAG AAG GAG CGG GAG GGA CTC AGA 318 

TEADTI S GRVDRLEREVDYL 97 

ACT GAG GCC GAC ACC ATC TCC GGG AGA GTG GAT CGT CTG GAG CGG GAG GTA GAC TAT CTG 378 

E T Q N P A L P .C 
GAG ACC CAG AAC CCA GCT CTG CCC TGT 

GT KG KG R RN KYDMVTDCGY 137 

GGG ACC AAA GGC AAG GGA AGA AGG AAT GAG AAG TAC GAT ATG GTG ACA GAC TGT GGC TAC 498 

T I S Q V R S M K I L K R F G G P A G L 157 

ACA ATC TCT CAA GTG AGA TCA ATG AAG ATT CTG AAG CGA CTT GGT GGC CCA GCT GGT CTA 558 

WT KD P L G QTEK I YVLDGT Q N 177 

TGG ACC AAG GAT CCA CTG GGG CAA ACA GAG AAG ATC TAC GTG TTA GAT GGG ACA CAG AAT 618 

DTAFVF P RLRDF TLAMAA R K 197 

GAC ACA GCC TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC ATG GCT GCC CGG AAA 678 

AS RVRVP FPHVGTOQLVY.GG 217 

GCT TCC CGA GTC CGG GTG CCC TTC CCC TGG GTA GGC ACA GGG CAG CTG GTA TAT GGT GGC 738 

F I# Y F A R R PPG. RP GGG G.EM E N 237 

TTT CTT TAT TTT GCT CGG AGG CCT CCT GGA AGA CCT GGT GGA GGT GGT GAG ATG GAG AAC 798 

TliQLIK FHLANRTVVDSSVF 257 

ACT TTG *CAG CTA ATC AAA TTC CAC CTG GGA AAC CGA ACA GTG GTG GAC AGC TCA GTA TTC 858 

PAEGL I P PYGLTAD TY I D Ii A 277 

OCA GCA GAG GGG CTG ATC CCC CCC TAC GGC TTG ACA GGA GAC ACC TAC ATC GAC CTG GCA 918 

A D E E G L W A V Y A T R E D D R H L C 297 

GCT GAT GAG GAA GGT CTT TGG GCT GTC TAT GCC ACC CGG GAG GAT GAC AGG CAC TTG TGT 978 

I*AKIjDPQTLDTEQQ WDTPC P 317 
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CTG GCC AAG TTA GAT CCA CAG ACA CTG GAC ACA GAG CAG CAG TGG GAC ACA CCA TGT CCC 1038 



FVICGTLYVVYN_T 337 

TTT GTC ATC TGT GGG ACC CTC TAT GTC GTC TAT AAC ACC 1098 

RIQC SFDASGTLTP 357 

CGC ATC CAG TGC TCC TTT GAT GCC AGC GGC ACC CTG ACC CCT 1158 



R E N A E A A 

AGA GAG AAT GCT GAG GCT GCC 

R P A S R A 
CGT CCT GCC AGT CGG GCC 

E R A A L P Y 
GAA CGG GCA GCA CTC CCT TAT 

N P R E R Q L 
AAC CCC CGA GAA CGC CAG CTC 



F P R R Y G A 
TTT CCC CGC AGA TAT GGT GCC 



H A S L R Y 377 
CAT GCC AGC CTC CGC TAT 1218 



YAWDDGYQIVYKL 397 
TAT GCC TGG GAT GAT GGC TAC CAG ATT GTC TAT AAG CTG 1278 



EMRKKEE E V * 407 
GAG ATG AGG AAG AAA GAG GAG GAG GTT TGA 1308 

GGAGCTAGCCTTGTTTTTT G CATCTTTCrCACTCCCATAC^ 1387 

CTTCAAATGTGGGCCAGTTGTGGCrCAAATCCTC 1466 

TCATACGGAACTCCAGATCCTGAGTAATCCTTTTAGAGCCCGAAGA 154 5 

GCCCCATGTCAACAAATTTCAGGCTAAGGATGCCCCAGACCCA 1624 
AGGCAGCAGTGTTCTTCCCCTCAG AGTG ACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCT 1703 

CACTCCTCCCTTGAGTGTCCIX3AGGAACAGGACTTTCT 1782 

ATCCACTGCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 1831 
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, i , • , i j i , , | • , , , , , i , . , i j • | . j 1 1 1 1 • i « i • i • i • i 



1 41 81 121 161 201 241 281 321 3G1 401 



MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQDQSSRHAAELRDFKN 
KMLPLLEVAEKEREALRTEADTISGRVDRLEP^VDYLF/TQNPALPCVEFDEKVTGGPGTK 

gkgrrnekydmvtix:gytisqvrsmkilkrfggpaglwtkdplgotekiyvldgtqndta 
fvfprlrdftlamaarkasrvrvpfpwvgtgqlvyggflyfarrppgrpggggementlq 
likfhlanrtvvdssvfpaeglippygltadtyidlaadeeglwavyatreddrhlclak 
ldpqtldteqqwdtpcprenaeaafvicgtlywyntrpasrariqcsfdasgtltpera 
alpyfprrygahaslrynprerqlyawddgyqivyklemrkkeeev 
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MGPSA PLLLLFF 12 

GTCGACCCACGCGTCCGACTTAAGGCTGCC ATG GGG CCC AGT GOT CCT CTG CTG CTC CTC TTC TTT 66 

LS WTGPLQGQ QHKLVEY MER 32 

TTG TCA TGG ACG GGA CCC CTT CAG GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC 126 

RLAALEERLAQCQDQ'SSR HA 52 

CGA CTA GCT GCC TTA GAG GAA CGG CTG GCC GAA TGC CAG GAT CAG AGT AGT CGG CAT GCT 186 

A ELRDFKNKMLPLLEVAEKE 72 

GCC GAG CTT CGG GAC TTC AAA AAC AAG ATG TTG CCT CTC CTG GAG GTG GCA GAG AAG GAG 2 46 

R E T L R TEAD S I S G RVD R L E R 92 

CGG GAG ACC CTC AG A ACT GAA GCA GAC TCC ATC TCA GGA AG A GTG GAC CGT CTT GAA AGG 306 

EVDYLE TQNPALP CVE D E K 112 

GAG GTA GAC TAT CTG GAG ACA CAG AAC CCA GCT TTG CCC TGT GTA GAG CTG/ GAT GAG AAG 3 66 

VTGGPGA^KGKGR RNEKYDMV 132 

GTG ACT GGA GGT CCT GGA GCC AAA GGC AAG GGC CGA AGA AAT GAG AAA TAC GAT ATG GTG 4 26 

T D C S. YTV AQVR SMKILKRF G 152 

ACG GAC TGT AGC TAC ACA GTC GCT CAG GTG AGG TCA ATG AAG ATC CTG AAG CGG TTT GGT 4 86 

G S j G L W T K D P L G P A ' E K I Y V L 172 

GGT TCA^/'Gt^ GGC CTA TGG ACC AAG GAT CCG CTG GGG \CCA GCA' GAG AAG ATC TAC GTG TTA 546 

DGTQNDT AFVFPRLRDFTLA 192 

GAC GGC ACC CAG AAC GAC ACG GCT TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC 606 

MAA R KA S R I RVP F P W V G T G Q 212 

ATG GCT GCC CGG AAA GCT TCC CGA ATT CGG GTG CCC TTC CCC TGG GTA GGC ACG GGG CAG 666 

LVYG'G F LYYARR P PGG PGGG 232 

CTG GTG TAC GGT GGC TTC CTT TAT TAT GCT CGA AGG CCT CCT GGA GGA CCT GGA GGG GGT 726 

GBLENT I.QLIKFHL .ANRTVV 252 

GGT GAA TTG GAG AAC ACT CTG CAG CTG ATC AAA TTT CAC TTG GCA AAC CGA ACA GTG GTG 786 

D S 8 V F P A E S h I P P Y G L T A D T 272 

GAT AGC TCA GTG TTC CCT GCA GAG AGC CTG ATA CCC CCC TAC GGC CTG ACA GCA GAT ACA 846 

YID LAAD EEGLWAVYATRDD 292 

TAT ATC GAC CTG GCA GCT GAT GAG GAG GGC CTG TGG GCT GTC TAT GCC ACT CGA GAT GAT 906 

D R H L C LA K L D P Q T I* D T E Q Q W 312 

GAC AGG CAT TTG TGT CTA GCC AAG TTA GAC CCA CAG ACA CTT GAC ACA GAG CAG CAG TGG 966 

D TPCPRENAEAAF VICGTL Y 332 
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GAC ACA CCA TGT CCC AGA GAG AAC GCA GAG GCT GCG TTT GTC ATC TGT. GGG ACC CTG TAC 1026 

VV YNTRPAS RAR I Q CS F D A "~S 352 

GTT GTC TAT AAC ACC CGC CCT GCC AGT AGG GCT CGT ATT CAG TGT TCC TTC GAT GCC AGT 1086 

GTLAPERAALSYFPRRYG. AH 372 

GGT ACT CTC GCC CCT GAA AGG GCA GCA CTC TCC TAT TTT CCA CGC CGA TAT GGT GCC CAT 1146 

A S L RYNPR E RQLY A WDD G YQ 392 

GCC AGC CTT CGC TAT AAC CCC CGT GAG CGC CAG CTG TAT GCC TGG GAT GAT GGC TAC CAG 1206 

I VY KLEMK K KEE E V * 407 

ATT GTC TAC AAA TTG GAG ATG AAG AAG AAG GAG GAG GAA GTT TAA 1251 

GCAGCTAGCCTTGTGCTCTTGATTCTTATGCCCAGACATTTATA 13 30 

CGAAGGCCAGTGGTGGTAGCTCATATACCCTAATTTCTAAAGGACA^ 1409 

ACTCCAG^TCCTGGGTAGCATTTTAGAACTGAAGAGCAAA 14 88 

AGTTTAGTTCCAAACTCAGAGCCCTGTCCTTTGGAGAGGGTCA^ 156 7 

TATGACCGAAGGGAGAGAACTCAGAGACAAAGCTGCCCTCCCT 1646 

CCGCACATCACTTTGTATGGTAACAGTTTGCATTAAAAGGAA 1721 
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r.« Htm t.iti 



(I , . | I | « , , , I , . , . , I , . , I | I | • | I , I , • , I | • | I | I | 

1 41 81 121 161 201 241 281 321 381 401 

>mT257 

MGPSAPLLLLFFLSVrrcPLQGQQHHLVEYMERRLA 

KMLPLLEVAEKERETLRTEADS I SG RVDRLEREVD YLETQN P ALPC VELDEKVTGG PG AK 

GKGRRNEKYDMVTDCS YTVAQVRSMKI LKRFGG S VGLWTKDPLG PAEKI YVLDGTQNDTA 

FVFPRLRDFTLAMAARKASRIRVPFPWGTGQLVYGGFLYYARRPPGGPGGGGELE^W 

LI KFHLANRTWDSSVFP AESLI P P YG LTADTYIDLAADEEGLWAVYATRDDDRHLCLAK 

LDPQTLDTEQQVTDTPCPRENAEAAFVICGTLYV^ 

AIiS Y FPRR YG AHAS LRYN PRERQ L YAWDDG YQ I VYKLEMKKKE EEV ' 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPleaae cltei Myers and Killer, CABIOS (1989) 

> hT257 a. a. 406 aa vs. 

> mT257a.a. 406 aa 
scoring matrix s paml20.mat, gap penalties: -12/-4 

94.1% Identity; Global alignment score t 2097 

10 20 30 40 50' 60 70 

inputs KGPSTPLLILFLLSWSGPWGTOKHLVEYMERRLJ^ 

::::•:::. :<:::::::::::::::::::: 
HGPSAPiXLLFFLSWTCPLQGOQHHLVEYME^^ 

10 20 30 40 SO 60 70. 

80 90 100 110 120 130 140 

inputs KEREAtJlTEAOTISGRVDRLEREVDYL 

iiSS«*s*«!*»« * i **•*«••«• , ji#*it»SJ'J»»»*»it»» 

KERTITJITEADS I SGRVDRLEREVD YLETQNPALPCVXLDEKVT^ 

80 90 100 110 120 130 140 

ISO 160 170 180 190 200 210 

inputs OVRS M K I LKRFGG PAG LWT KD PLGOT E KI YVLDGTQNOT AFVT PRIJtDFTLAMAARKAS RVR VP F P WVGT 

I::::::::::::..:::::::::..::::;::::::;::::::::::::::::::::::.::::::::: 
OVRSMKILK3lFGGSVGLVn*KDPI^PAEiaYVUXnQKOT 

ISO 160 170 180 190 200 210 

220 230 240 2S0 260 270 280 

inputs GQLVYGGFLYFARRP PGRPGGGG EMEKTLQL X KFHtAKRTVVDS SVFP AEGL I P PYGLTADTYI DLAADE 

GQLVYGG FL YY ARRP PGG PGGGG E L EOTLQL X KFHLMfRTWDS SVF P AES L I P P YG LT ADTY I DLAADE 
220 230 240 250 260 270 280 

290 300 310 320 330 340 3S0 

inputs EGLWAVYATREDDRKLC1AKX*DPQ11J>TEQQWOT 

::::::::::.:::::::::::::::::::::::::::::::::::::::::::::::::::::•*-:::: 
EG L WAVYATRDDDRHLCLAKLD PQTLDTEQQWDT PCPREKAEAAFVT CGTLYWYNTRPASRARI QCS FD 
290 300 310 320 330 340 350 

360 370 380 390 400 

inputs ASGTLTPERAALPYFPRRYGAHASLAYKPRERQLYAHDDGYQIVYiaEKRKKEEEV 
r it::.:::::;.::::::;::::::::::::::::;;;::::::::::.::::: : 
ASGTLAPERAALS YFPRRYG AHAS LRYNPRERQLYAWDDGYQ IVYKLEMICKXEEEV 
360 370 380 390 400 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite* Myers and Killer, CAB10S (1969) 

> hT257 a,e. 406 aa vs. 

> Patent Protein W75120 - (untitled) 355 aa 
scoring matrix t paml20.nat, gap penalties: -12/-4 

86.9% Identity; Global alignment score: 1681 

10 20 30 40 50 60 70 

inputs MGPSTPLLILFIXSWSGPUW<X)HHLVEYME^^ 

x J : t :: J J::!::::::::::::::::::::::::::;;;::,;:::::::;;::::::::::;::::: : 
HGPSTPLLILFU^WSCPLQGTOHHLVEYMERRU^ 

10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs KEREALRTEADTISGRVDRLEREVDYLETQNPA^ 

KEREALRTEADT I SGRVDRLEREVD YL ETQN PAL PCVE FDE KVTGG PGT KG KGRRN E KYDMVTDCG YT I S 
80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

inputs QVRSMKILKRFGGPAGLWTIODPIXTOTEKIYVIJX^^ 

QVRSMKILlOtfXXJPAGLVnrKDPUKrrEia 

150 160 170 180 190 200 210 

220 230 240 250 260 270 2B0 

inputs GQLVYGGFLYFARRPPGRPGGGGEMENTLQLIKFKLAN^ 

::::::: t :::::::::::::::::::: r i — : 
GQLVYGGFLYFARRPPCRPGGGGEMENTLQLI KFH1-ANRTVVDSSVFPAEGLI PPYGLTADTYIDLAADE 
220 230 240 250 260 270 280 

290 300 310 320 330 340 350 

i nputs EGLWAVYATREOORHLCIJUCLDPQTLDTEQOWDTPCPRENAEAAFVICGTLY^ 

EGLWAVYATRi^DRHbClJOaJDPQTI^EQQW 

290 300 310 320 330 340 350 

360 370 380 390 400 

inputs ASGTLTPERAALPYFPRRYGAHASLRYNPR£RQLYAWDDGYQIVY10C#EMRKKEEErV 
: : 2 . 

ASGPX 
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ALIGN calculates a global alignment of two sequences 

version 2.0uPlease cite: Myers and Miller. CABIOS (1989) 

" v " • «• 1832 aa vs. 

% ac0214« 192S aa 

scoring matrix: pan>120.mat, gap penalties: -12/-4 
93. S% identity; Global alignment score: 9158 

f 

10 20 30 40 . 50 60 

inpu t s GTCGACCCACGCGTCC- - - GCGGACGCGTGGG - - TGAGGGG AAGACGCTGACTGTACGTTCCTTCTACTC 

CCC-CGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGANGCTGACTGTACGTTCCTTCTACTC 

10 20 30 40 50 60 

70 80 Q0 100 110 120 130 

i npu t s TGGCACC ACTCTCC AGGCTGCC ATCGGGCCCAGC ACCCCTCTCCTC ATCTTCTTCCTTTTGTCATCGTCC 
: : = : 2 :::::::::::::::::: : :::::::::::: 2 ::::::: 2 2 r :: : 
TGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCCTCTCCTCATCTTGTTCC 

7 0 80 90 100 110 120 130 

140 150 160 170 180 190 200 

i npu t s GCACCCCTCCAACG AC ACCAGCACC ACCTTGTGG AGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 

GGACCCCTCCAAGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 
140 150 160 170 180 190 200 

210 220 230 240 250 260 270 

i npu t s GGCTGGCCC AGTGCCAGG ACC AGAGTAGTCGGCATGCTGCTG AGCTGCGGGACTTC AAGAACAAG ATGCT 

■■■••••••■••••••••••••••••••••••a* - _ . ••••■•■••••■••••••••• 

GGCTGGCCCACrrcCCAGGACCAGAGTAGTCGGCATXKr^ 

210 220 230 240 250 260 270 

280 290 300 310 320 330 340 

inputs -GCCACTGCTGGAGGTGGCAGAQAAGGAGCGGGAGGCACTCAGAACTGAGGCCGACACCATCTCCGGGA^ 

2'2 222»222?2! * 2 2 2 2 I I ! I r t ; ! I ! • * • • * • • • ' 4 • •••••••••••••••••«•••••••••••• • 

••■••"•■••••"••••••••••••••••••••*•••••••••••••••*••• 

NGCCACTGCTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAG 

280 290 300 310 320 330 340 

350 360 370 380 390 400 410 

inputs ACrTGGATCGTCTGGAGCGGGAGGTAGACTATCTGGAGA^ 



> * ♦ • 



••••••••••••••••••••••••■••••••••••••a*** 



AGTGGATCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAA 

3S0 360 370 380 390 400 410 

*20 «0 440 450 460 470 480 

inputs GATGAGAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAG 

: 2 :::::::: ': ::::: x :::::::::: s s : : :t::::t:::t::::t 
GATGAGAAGGTGACTGGAGGCCCTGGGACCAAAG 

420 430 440 450 460 470 480 

490 500 S10 520 * 530 540 550 
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inputs CAGACTGTGGCTACACAATCTCTCAAGTGAGATCAATGAAGATT^ 

izzzzzzzzzzztzzzzzzzzzzizzzzzzzzzzzzzzzzzzzzzzzizzzzzzzzzzzzzzzzttziiz^ 
CAGACTGTCCCTACACAATCTCTCAAGTGAGATCAATGAAGAT^ 

490 500 510 520 530 540 550 

560 570 580 590 600 610 620 

i npu t s TCTATGGACCAAGGATCCACTGGGGCAAACAGAGAAGATCTACGTCTTAGATGGGACACAGAATGACACA 

::::::::::::::::: z :::::::::::::::::::::::: : :::: s ::::::::::: 2 : s : ' - : : : : : 
TCTATGGACCAAGGATCCACTGGGCCAAACAGAGAAGATCTACGTGTTAGATGGGACACAGAATGACACA 
560 570 580 590 600 610 620 

630 640 650 660 670 680 690 

i npu t s GCCTTTGTCTTCCC AACGCTGCGTG ACTTC ACCCTTGCC ATCGCTGCCCGG AAACCTTCCCG AGTCCGGG 

GCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTTGCCATGGCTGCCCGGAAAGCTTCCCGAGTCCGGG 
630 640 650 660 670 680 690 

700 710 720 730 740 750 760 

i npu t s TGCCCTTCCCCTGCGTAGGCAC ACGGCAGCTGGTAT^^ 

• ••...•»•..»•...•♦••••••-••••••••-••••-•*•**••••*• 

TGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGGTGGCTTTCTTTATTTTGCTCGGAGGCCTCC 
700 710 720 730 740 750 760 

770 780 790 800 810 820 830 

inputs TGGAAGACCTGGTGGAGGTGGTGAGATGGAGAACACTTTGCA 

TGGAAGACCTGGTGGAGGTGGTGAGATGGAGAACACITTGCAGCT 

770 780 790 800 810 820 830 

840 850 860 870 880 890 900 

inputs ACAGTGGTCXJACAGCTCAGTATTCCC^^ 

:::::::::::::::::::::::::::::::: • :::::::::::::::: : * : ' ' : : : : : : : : : 5 : : : : : : 
. ACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCT 

840 850 860 870 880 890 900 

910 920 930 940 950 960 970 

inputs ACATCGACCTGGCAGCTGATGAGGAAGGTCTTTGGG 

;::::::::::::::s:;::;::::::::;:;i::::::::5i"i!!i J ::j:::i:::::::5: S H1 
ACATCGACCTGGCAGCTGATGAGGAAGGTCTTTGGG 

910 920 930 940 950 960 970 

980 990 1000 1010 1020 1030 1040 

inputs GTGTCTGGCCAAGTTAGATCCACAGACACTGGACACAGAGCAGC 

;:;iJ:::::::::i::::;:::::::::::::::i:::::s:iSS!SS!i !j:5!:::::s:SSJ!:,: 
GTGTCTGGCCAAGTTAGATCCACAGACACTGGAC^ 

980 990 1000 1010 1020 1030 1040 

1050 1060 1070 1080 1090 1100 1110 

inputs AATGCTGAGGCTGCCTTTGTCATCTGTGGGAC 

:::ii::::::::::;::.:::;::::s:i"i:;:::::j;!i- ::::::j:;j:::::SIj:::!:: 
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AATGCTGAGGCTGCCTTTOTCATCW 

1050 1060 1070 1080 1090 1100 1110 - 

1120 1130 1140 1150 1160 1170 1180 

i npu t s CCCGC ATCC AGTGCTCCTTTG ATGCCAGCGGC ACCCTGACCCCTGAACGGGCAGC ACTCCCTTATTTTCC 

CCCCCATCCAGTCCTCCTTTGATGCC^^ 

1120 1130 1140 11SU 1160 1170 1180 

1190 1200 1210 1220 1230 1240 1250 

inputs CCGCAGATATGGTGCCCATGCCAGCCTCCGCTATAACCCCCGAGAACGCCAGCTCTATGCCTGGGATGAT 

222*22222222222222222222.I5"."* , ***** , * , "**** > ******** , ***"* ,#, *" , *****»* 
■*•*•""•••""*••••*.***••••*•••••••••••••••••»•••<»»••••••••••••••••••»». 

CCGCAGATATGGTGCCCATGCCAGCCTCCGCTATAACCCCCGAGAACGCCAGCTCTATGCCTGGGATGAT 
1190 1200 1210 1220 1230 1240 1250 

1260 1270 1280 1290 1300 1310 1320 

i npu t s GGCTACCAGATTGTCTATAAGCTGGAGATGAGGAAGAAAGAGGAGGAGGTTT 

*•••••"*•***••••••-•"••••••••••••«••••«••••••••••••••••••••••■••••••» 

CGCTACCAGATTGTCTATAAGCTGGAGATGAGGAAGAAAGAGGAGGAGGTTTGAGGAGCTAGCCTTGTTT 
1260 1270 1280 1290 1300 1310 1320 

1330 ' 1340 1350 1360 1370 1380 1390 

i npu t s TTTGCATCTTTCTCr.CTCCCATACATTTATATTATATCCCCACT 

"*"***•***""•••••••••••••••••«•••••••-••••••••••••••••»••••••••-•»•••• 

*•••••••••*•••***•••••♦••»••••••••••••••■»•«••■•«••••••»••••■«••••••»• 

TTTGCATCTTTCTC ACTCCC AT AC ATTTATATT ATATCCCC ACTAAATTTCTTG ATTCTTC AAA 

1330 1340 13S0 1360 1370 1380 1390 

1400 1410 1420 1430 1440 1450 1460 

inputs TGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTTAGC 

**************** * • • • » • 

TGTGGGCC AGTTGTGGCTC AAATCCTCTATATTTTT AGCTCCTTTG 
1400 1410 1420 1430 1440 1450 1460 

1470 1480 1490 1500 1510 1520 1S30 

inputs TTTCATACGGAACTCCAGATCCTGAGTAATCCTTTTAGAGCC 

•••*••*••*•*•••**■***•••••••••••••••••*••*»•*•••*«»»»••••••*•••»••••"* 

TTTCATACGGAACTCCAGATCCTGACTAATCCTTT^ 

1470 1480 1490 1500 1510 1520 1530 

1540 1550 1560 1570 1S80 1590 1600 

inputs CCTGCTCTCCTGCCCCATGT^ 

I::::::::::::::::::::;::::::::;:::::::::::::::: :::::::::::::::::::::: 
CCTGCTCTCCKXX^ 

1540 1550 1560 1570 1580 1590 1600 

1610 1620 1630 1640 1650 1660 1670 

inputs ATGCGGGCAGQCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCT 

ATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCC^ 

1610 1620 1630 1640 1650 1660 1670 
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1680 1690 1700 1710 1720 1730 1740 
inputs AGGAGACCrrCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGT^ 

AGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTC 

1680 1690 1700 1710 1720 1730 1740 

1750 1760 1770 1780 1790 1800 1810 
inputs ACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAA 

• •••••••*••»••■••*•••• 

AC ATTGTTTTGTATTGC AAC ATTTTGCATTAAAAGG AAAATCC AN AAAAAAAAAAAAAAAAAAAAAJ^ 

1750 1760 1770 1780 1790 1800 1810 



1820 1830 
inputs AAAAAAAAGG GCGGCCGC- 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAC 

1820 1830 1840 1850 1860 1870 1880 



inputs 



TCGTCTTCTCGCAGCCGTACCCITCTGTCGTCTTCTCGCAGCC 
1890 1900 1910 1920 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT2S7a.a. 406 aa vs. 

> Patent Protein W75120 - (untitled) 355 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

81.8% identity; Global alignment score: 1599 

10 20 30 40 50 60 70 

inputs MGPSAPLLLLFFLSWTGPLQGQQHHLVEYMERRIA^ 

MGPSTPLLILFLLSWSGPLQGQQHHLraYMERMi^ 

10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs KERETLRTEADSISGRVT)RLERE\TOYLETQNPALPW 

• :i J * • : • s • . 
KEREALRTEADTISGRVDRLEREVDYLETQNPALPCVEFDEKVTGGPGTKGKGRRNEKYDMVTDCGYTIS 
80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

inputs QVRSMKILKRFGGSVGLWTKDPLGPAEKIYVIiDGTQOT^ 

:::: 2 ::::::::..:::::::::..::::::::::::::::::::::::::::::::::.::::::: j : 
QVTISMKILKRFGGPAGLWTKDPLGQTEKIYVTjDGTQ 

150 160 170 180 190 200 210 

220 230 240 250 260 270 280 

inputs GQLVYGGFLYYARRPPGGPGGGGELENTLQLIKFHLANRTVVDSSVFPAESLIPPYGLTADTYIDIiAADE 

::::::::::»:::jti ****** ••••■••■••»-•••••••••••«•• ••••••»•■•••••••■•* 

GQLV^XXSFLYFARRPPGRPGGGGEMENTLQLIKFHU^R 

220 230 240 250 260 270 280 

290 300 310 320 330 340 350 

npu t S EGLWAVYATRDDDRHLCLAKLDPQTLDTEQQWDTPCPRENAEAAFV I CGTLYWYNTRPAS RARIQCS FD 

EGLWAvTATREDDRHLCLAKLDPQTLDTEQQWDTPCPRENAEAAFVICGTLYvAmiTRPASRARIQCSFD 
290 300 310 320 330 340 350 

360 370 380 390 400 

nputS ASGTILiAPERAAIjSYFPRRYGAHASLRYWPRERQLYAWDDGYQIVYKLEMKKKEEEV 

: : : 

ASGPX 
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ALTON calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller , CABIOS (1969) 

> OT257 n.a. X721 aa vs. 

> Patent nucleotide V34217 - (untitled) 1890 aa 
scoring matrix t paml20.mat, gap penalties: -12/-4 

76.2% identity; Global alignment score: 6493 

10 20 

inputs GT CO ACCCAC GCGTCC GACTTAAGG 

s • ::::::: ::::: :,:::.::: 

GAGCAGGAGAGAAGGCACCGCCCCACCCCX3CCTCCAAAGCTAACCCTCGGGCTC 

10 20 30 40 50 60 70 

30 40 50 

i npu CS CTGCCATGGGGCCCAGTGCTCCTCTGCTGCTCCT 



CTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCCTCTCCTCATCTT 
80 90 100 110 120 130 140 

60 70 80 90 100 110 120 

inputs CTTCTTTTTGTCATGGACGGGACCCCTTCAGGGACAGCAGCACCACCTTGTGG AGTACATGGAACGCCGA 

GTTCCTTTTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGA 
150 160 170 180 190 200 210 

130 ^ 140 150 160 170 180 190 

i npu t s CTAGCTGCCTTAGAGGAACGGCTGGCCCAATGCCAGGATCAGAGTAGTCGGCATGCTGCCGAGCTTCGGG 

CTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCCAGGACCAGAGTAGTCGGCATGCTGCTGAGCTGCGGG 
220 230 240 250 260 270 280 

200 210 220 230 240 250 260 

inputs ACTTC AAAAACAAGATGTTGCCTCTCCTGGAGGTGGCAGAG AAGGAGCGGGAGACCCTCAG AACTGAAGC 

ACTTCAAGAACAAGATGCTGCCACTGCTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTGAGGC 
290 300 310 320 330 340 350 

270 280 290 300 310 320 330 

i npu 1 6 AGACTCCATCTCAGGAAGAGTGG ACCGTCTTG AAAGGG AGGTAGACTATCTGGAGACACAGAACCCAGCT 

•••^ • * * « * » • + « • • * * 

*********** *••#••••*•■ * * ■ « * • * « • «•*•)«■*«•»**«•••****• ••#••••**••• 

CGACACCATCTCCGGGAGAGTGGATCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCT 
360 370 380 390 400 410 420 

34 0 350 360 370 360 390 400 

Lnpu t S TTGCCCTGTGTAGAGCTGGATGAGAAGGTG ACTGG AGGTCCTGGAGCCAAAGGCAAGGGCXX3AAGAAATG 

CTGCCCTGTGTAGAGTTTGATGAGAAGGTGACTGGAGGCCCTGGG 

430 440 450 460 470 480 490 

410 420 430 440 450 460 470 

nputs AGAAATACX3ATATGGTQACGGACTGTAQCTACACAGTCGCTCA 

::»x.::i:::::i:::::. ::::::. :::;:xi:.:: t t : n t ::: t : : :::::::: 

AGAAGTACGATATOGTGACAGACTGlXXKrTACACAATCTCTCAAGTO 

S00 510 520 530 540 550 560 • 

480 490 500 510 520 530 540 

nputs GTTTGGTGGTTCAGTTGGCCTATGGACCAAGaATC 

. : x : : x x x ; m t:i 1 1 1 1 1 1 : s 1 1 1 1 1 s it s • 1 1 1 1 1 1 1 I • 1 1 x x : x x * : : : t : x : i : : 1 1 : : 
ATTTGQTtlXKXX^GCTGgTCTATGGACCA^ 

570 580 590 600 610 620 630 

550 560 570 580 590 600 610 

nputs GGCACCCAGAACOACACGGCTTTTGTCTTCCCAAGGCTO 

tt ft tttts ttttt.it tftttttttttttttttttttittttitttttttttttitttttttttt 

640 650 660 *70 680 690 700 

620 630 640 650 660 670 680 • 

iputs AAOCTTCOCQAATTCOOQTQCCCTTCCCCT 

ttttttttttt.t tffttttttttfltt»*«ltlittitt.ttttttltttt,tt 1 1 1 1 1 II t tttxx 
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AAGCTTCCaJAGTCCGGOTGCCCTTCC^ 

710 720 730 740 7S0 760 770 

690 700 710 720 730 740 750 

inputs TTATOCTCGAAGGCCTCCTGGAGGACCTGOAGGGGGTGGTO 

tt* i ::: « t t j t : ::::::_ 
TTTTGCTCGGAGGarrCCTOOAAGACCTGOTTC 

780 790 800 810 820 830 640 

760 770 780 790 800 610 820 

inputs TTTCACTTGGCAAACCGAACAGTGGTGGATAGCTCAGTGTTCCCTGCAGAGAGCCT 

TTCCACCTGGCAAACCGAACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACG 
850 860 870 880 890 900 910 

830 84 0 850 860 870 ' 880 890 

i npu C s GCCTG ACAGCAGATACATATATCGACCTGGCAGCTGATGAGGAGGGCCTGTGGGCTGTCTATGCCACTCG 

GCTTGACAGCAGACACCTACATCGACCTGGCAGCTGATGAGGAAGGTCTTTGGGCTGTCTATGCCACCCG 
920 930 940 950 960 970 980 

900 910 920 930 940 950 960 

i npu t S AGATGATGACAGGCATTTGTGTCTAGCCAAGTTAGACCCACAGACACTTGACACAGAGCAGCAGTGGGAC 

GGAGGATGACAGGCACTTX3TGTCTGGCCAAGTTAGATCCACAGACACTGGACACAGAGCAGCAGTGGGAC 
990 1000 1010 1020 1030 1040 1050 

970 980 990 1000 1010 1020 1030 

inputs ACACCATGTCCCAGAGAGAACGCAGAGGCTGCGTTTGTCATCTGTGGGACCCTGTACGTTGTCTATAACA 

ACACCATGTCCCAGAGAGAATGCTGAGGCTGCCTTTGTCATCTGTGGGACCCTCTATGTCGTCTATAACA 
1060 1070 1080 1090 1100 1110 1120 

1040 1050 1060 1070 1080 1090 1100 

i npu t s CCCGCCCTGCCAGTAGGGCTCGTATTCAGTGTTCCTTCGATGCCAGTGGTACTCTCGCCCCTGAAAGGGC 

*•♦•»••** * » • * * * * • * • •* • * * * 

• • • » *•*••«•«* m m m • * • • * «»»•• • * ■ • • •*••»«•• • » • * 

CCCGTCCTGCCAGTCGGGCCCGCATCCAGTGCTCCTTTGATGCCAGCGG-ACCCTGACCCCTGAACGGGC 
1130 1140 1150 1160 1170 1180 

1110 1120 1130 1140 1150 1160 1170 

inputs AGCACTCTCCTATTTTCCACGCCGATATGGTGCCCATGCCAGCCTTOGCTATAACCCCCGTO 

«■#••*. • • • ■ i i I m » ■ ■ »••.•••••»•••§•••••••••• 

AGCACTCCCTTATTTTCCCCGCAGATATGGTCCCCATGCCAGCCTCOGCTATAACCCCCGAGAACG^ 
1190 1200 1210 1220 1230 1240 1250 

1180 1190 1200 1210 1220 1230 1240 

inputs CTGTATGCCTGGGATGATGGCTACCAGATTGTCTACAAATTGGAGATO 

:: :::::::::::::::::::::::::::::::: ::. : : : : : s : : 1 *\\ 
CTCTATGCCTGGGATGATOGCTACCAGATTCTCTATAAGCTGGAG^ 
1260 1270 1280 1290 1300 1310 1320 

1250 1260 1270 1280 1290 1300 1310 

inputs AAGCAGCTAGCCTTGTGCT CTTGATTCTTATGCCCAGACATTTATATT CCTGTGAGCTCTCC 

• it itttttiiiits x i.t .itxt : 1 1 : i : x s 5 :::::: : it • I • * *i 

GAGGAGCTAGCCTTGTTTTTTGCATCTTTCTCAC^ 
1330 1340 1350 1360 1370 1380 1390 

1320 1330 1340 1350 1360 1370 

inputs TtXZAOTTC AT CCTTCAAAAOQAAOQCCAQTQ AGGACAA 

5 * • itu until. i..iititii xtxx ::it.is s mi..:: :.x : x.sx: 
TGTTCCTCATTCTTCAAAT-GTCGGCCAGTTGTGG - - -CTCAAATCCTCTATATTTTTAGCCAATGGCAA 
1400 1410 * 1420 1430 144u 1450 1460 

1360 1390 1400 1410 1420 1430 1440 

inputs CCAAATTCTCA-AOCCXXriX^ 

tsntttt . in tn tttti tt.t, sititsttitistti.it t. t i nun. t • t it 
TCAAATTCTTTCAGCTCCTTTGTTTCATACGGAAC^ 
1470 1460 1490 . 1500 1510 1520 1S30 

1450 1460 1470 1460 1490 
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inputs CAAACAAACACCCTAAAT CTTCACTCCTGCCTTATGTCCACAAAGTT TAGTT CC 

• • •••• • **•••* •••••••••• S * w • • ••••• a a 

AGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCT^ 

1540 1550 1560 1570 1580 1590 1600 

1500 1510 1520 .1530 1540 1550 1560 

i npu t S AAACTCAGAGCCCTGTCCTTTGGAG AGGGTCAACCCCAGACAGCAGGCGACAGCATTCTTGCCCTCAGTA 

• • • *••• » . . ■ ■ • ■ ■ • • • "??5SIS-^ 

• • » • a a a a a • • * a •••«•• 9 **»am»m»»99 a • • a • • •••»»•••• 

AGACCCAGGGCTCTAACCTTGTATGCGGG-CAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAG 
1610 1620 1630 1640 1650 1660 1670 

1570 1580 1590 1600 1610 1620 
inputs TGACC - GAAGGG AG AG AACTCAG AG A CAAAGCTGCCCTC CCTCCCTTCCCCCTCCAGTG 

■ a a a * • . • • a a • a • a a a a a a a • a • « aaaa aa a • • • a a a a • a 

aaaa «♦••••«••••« • • • a « a • a a » a » aaaa a » • a ■ • a • a • » a a a a a 

TGACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGTG 
1680 1690 1700 1710 1720 1730 1740 

1630 1640 1650 1660 16?0 1680 1690 

i npu t s TAGGGG AGAATGGGGCTTTCCCCACATCACITTGTATGGTAACAGTTTGC ATTAAAAGGAAAACCCAC - - 

• a a a a aa »»•*• 9 • 9 • w * • • » 9 * J ! ■ J 
a a. aaaa »•••••-•• a a a • a a a • a • a • a • a aaaa aaaa 

TCCTGAGGAACAGGACTTTCTCCACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGA 

1750 1760 1770 1780 1790 1800 1810 

1700 1710 1720 
inputs CAAAAAAAAAAAAAAAGGG CGGC CG 

*»*999mm9m»9m»»9 aaaa J # 

• aaaaaaaaaaaa*..... a aaa 

CAAAAAAAAAAAAAAAAAAAAAAAaAAAAAAAAAACGGCACGAGGGGGGGTC 

1820 1830 1840 1850 1860 1870 1880 



inputs C- 



CATGCAT 
1890 
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GTCGACCCACGCGTNCin'CCAGCOTNCGGAGCCGCCCTGGGTGTCAGCGGCTCGGCTCCCGCGCACGCTCCGQCCGTCG 



79 



Mi 

CGCAGCCTCGGCACCTGCAGGrCCGTGCGTCCCGCGGCTGGCGCCCCTGACTCCGTCCCGGCCAGGGAGGGCC ATG l SS 

1SLPG PLV TN LXR FLFLGLS 21 

ATT TCC CTC CCG GGG CCC CTG GTG ACC AAC TTG NTG CGG TTT TTG TTC CTG GGG CTG AGT 21 S 

ALAPp SRAQLQLHLPANR LQ 4i 

GCC CTC GCG CCC CCC TCG CGG GCC CAG CTG CAA CTG CAC TTG CCC GCC AAC CGG TTG CAG 275 

A V E E G E S GASAWYTLH REVS 61 

GCG GTG GAG GAG GGG GAA AGT GGT GCT TCA GCA TGG TAC ACC TTG CAC AGG GAG GTG TCT 33 S 

SS Q pw E VPFVMWFF KQKE KE 81 

TCA TCC CAG CCA TGG GAG GTG CCC TTT GTG ATG TGG TTC TTC AAA CAG AAA GAA AAG GAG 395 

D Q V L S Y I NGVTTSKPGVSLV 101 

GAT CAG GTG . TTG TCC TAC ATC AAT GGG GTC ACA ACA AGC AAA CCT GGA GTA TCC TTG GTC 455 

YSMp SRN. LSLRVEGLQEKD S 121 

TAC TCC ATG CCC TCC CGG AAC CTG TCC CTG CGG GTG GAG GGT CTC CAG GAG AAA GAC TCT SIS 

GPYSCSVNVQDKQGKSRG HS 141 

GGC CCC TAC AGC TGC TCC GTG AAT GTG CAA GAC AAA CAA GGC AAA TCT AGG GGC CAC AGC 575 

IKTLELNVLVPPAPPSCRL Q 161 

ATC AAA ACC TTA GAA CTC AAT GTA CTG GTT CCT CCA GCT CCT CCA TCC TGC CGT CTC CAG 63S 

GVPHVGANVT LSCQSPRSKP 181 

GGT GTG CCC CAT GTG GGG GCA AAC GTG ACC CTG AGC TGC CAG TCT CCA AGG AGT AAG CCC 695 

AV Q YQ W D RQL PS FQ TFFA P A 201 

GCT GTC CAA TAC CAG TGG GAT CGG CAG CTT CCA TCC TTC CAG ACT TTC TTT GCA CCA GCA 75S 

LDVIRGS I» S L T N L S SSMAG V 221 

TTA GAT GTC ATC CGT GGG TCT TTA AGC CTC ACC AAC CTT TCG TCT TCC ATG GCT GGA GTC 81S 

Y V C KA H N EVGTAQ C N V T I* E V 241 

TAT GTC TGC AAG GCC CAC AAT GAG GTG GGC ACT GCC CAA TOT AAT GTG ACQ CTG GAA GTG 87S 

STOp GAAVVA EAVVGTI» VG X. 26X 

AGC ACA GGG CCT GGA GCT GCA GTG GTT GCT GAA GCT GTT GTG GGT ACC CTG GTT GGA CTG 935 

O L L A G It V It L Y B R R G KALE E P 281 

GGG TTG CTG GCT GGG CTG GTC CTC TTG TAC CAC CGC CGG GGC AAG GCC CTG GAG GAG CCA 99S 

ANDI KEDAIAPRTLP WPKS S:. 301 
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GCC AAT GAT ATC AAG GAG GAT GCC ATT GCT CCC CGG ACC CTG CCC TGG CCC AAG AGC TCA 1055 

D T I SKNGT .LSSVTSARALR — P 321 

GAC ACA ATC TCC AAG AAT GGG ACC CTT TCC TCT GTC ACC TCC GCA CGA GCC CTC CGG CCA 1115 

PHGPP RPGALTPTPSLSSQA 341 

CCC CAT GGC CCT CCC AGG CCT GGT GCA TTG ACC CCC ACG CCC AGT CTA TCC AGC CAG GCC 1175 

LPSPRHAHDRWGPPSTNI PH 361 

CTG CCC TCA CCA AG A CAT GCC CAC GAC AGA TGG GGC CCA CCC TCA ACC AAT ATC CCC CAT 1235 

PWWGFFLW L* 371 
CCC TGG TGG GGT TTT TTC CTT TGG CTT TGA 1265 

GCCGGftTGGGTGCTGNGCCTGTGATGGNGCCTGCCCAGAGTCAAGC^ 1344 

G^CTAAAGGATTTGGGGTCTCTCCTTCCTATAAGGGTCACCTCT 14 23 

TCCTGACCCITAGTACrCTGCCCCCACCrCTCTITACT 1502 

AGAAGGAGAAGAGGAAGTGGATCTGGAATTGGGAGGAGCCTCCACCCACCCCT 1581 

3AAATTAGCTACTCACCAAGAGTGAGGGGCAG AGACTTCC^GTCACTGAGTCTCCC^GGCCCCCITGATCTGTACCC(^ 166 0 

-CCCTATCTAACACCACCCTTGGCTCCCACTCCAGCTCCCTGTATTC 173 9 

rACTGGGGCAGAGGATAGGGAATCTCrTATTAAAACTAACATGAAATATGTGTTG 1818 
GATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGGCGGCCGC 1869 
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MISLPGPLVTNLXRFLFLGLSAIAPPSRAQI^LHLPANRLQAVEBGESGASAWYTLH 

SSSQPWBVPFVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRVEGLQEKD 

SGPYSCSVNVQDKOGKSRGHSIKTI^LNVIiVPPAPPSCRLQGVPHVGANVTLSCQSPRSK 

PAVQYQWDRQLPSFQTFFAPALD^RGSLSLTNLSSSMAGVWCKAHNEVGTAQCNVTLE 

VSTGPGAAWAEAWGTLVGLGLLAGLVLLYHRRGKALEEPANDIKEDAIAPRTLPWPKS 

SDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPRHAHDRWGPPSTNIP 

HPWWGFFLWL 
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GTCGACCCACGCGTCCXSGTGCACATT^ 7 9 

MI LQAGTPETSLL 13 

CCTGAGTACTCCGGGCCAAGGAGGGCC ATG ATT CTT CAG GOT GGA ACC CCC GAG ACC AGC TTG CTG 145 

RV h F LGL S T L AA F S R AQ M E L 33 

CGG GTT TTG TTC CTG GGA CTG AGT ACC CTT GCT GCC TTC TCC CGA GCT CAG ATG GAG TTG 205 

HVP PGLN KL E A V E G E EVV L P 53 

CAC GTG CCC CCG GGC CTC AAC AAA TTG GAA GCG GTA GAG GGA GAA GAA GTG GTG CTC CCC 265 

AWYTMAREESWSH PREVP I L 73 

GCC TGG TAC ACG ATG GCA CGG GAG GAG TCG TGG TCC CAC CCC CGG GAG GTG CCC ATC CTG 325 

I W F L E Q E G K E P N Q V L S Y I N G 93 

ATC TGG TTC TTG GAA CAA GAA GGG AAG GAA CCA AAC CAG GTG TTG TCT TAC ATT AAT GGA 385 

VMTN KPGTALVHS ISSRNVS 113 

GTC ATG ACA AAT AAA CCT GGA ACA GCC CTG GTC CAC TCT ATC TCT TCA CGG AAT GTG TCC 44 5 

LRLGA-LQEGD'. SGTYRCSVNV 133 

CTG CGC CTG GGG GCA CTC CAG GAG GGA GAC TCT GGG ACT TAC CGC TGT TCT GTC AAT GTG 505 

Q HDEGKS IG H S I K S I ELKV L 153 

CAG AAT GAT GAA GGC AAA AGT ATA GGC CAC AGC ATC AAA AGC ATA GAG CTC AAA GTG CTG 565 

VPPAPPSCS LQGVPYVGTNV 173 

GTT CCT CCA GCT CCT CCA TCC TGT AGT TTA CAG GGT GTA CCC TAT GTC GGG ACC AAT GTG 625 

TLNCKSPRS K PTA Q Y QWE R 1* 193 

ACC CTG AAC TGC AAG TCC CCA AGG AGT AAA CCT ACT GCT CAG TAC CAG TGG GAG AGG CTG 685 

APSSQVFFG PA LDAVRGS L K 213 

GCC CCA TCC TCC CAG GTC TTC TTT GGA CCA GCC TTA GAT GCT GTT CGT GGA TCT TTA AAG 745 

LTNLS IAMS GVYVC KAQNRV 233 

CTC ACT AAC CTT TCC ATT GCC ATG TCT GGA GTC TAT GTC TGC AAG GCT CAA AAC AGA GTG 805 

G F A K C NV T L D VM T G S K AA V V 253 

3GC TTT GCC AAG TGC AAC GTG ACC TTG GAC GTG ATG ACA GGG TCC AAG GCT GCA GTG GTC 865 

AGAVVGTFVGI*VI*IAGI#VLL 273 

3CT GGA GCA GTT GTG GGC ACT TTT GTT GGG TTG GTG CTG ATA GCT GGG CTG GTC CTG TTG 925 

YQRRSKTLEEIiANDIKE DAl 293 

CAC CAG CGC CGG AGC AAG ACC TTG GAA GAG CTG GCC AAT GAT ATC AAG GAA GAT GCC ATT 985 

APRT LPWTK G SD T I S K N G T h 313 

5CT CCC CGG ACC TTG CCT TGG ACC AAA GGC TCA GAC ACA ATC TCC AAG AAT GGG ACA CTT 1045 
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SS VTSARAL R PP KAAPPRPG 333 

TCT TCG GTC ACC TCA GCA CGA GCT CTG CGG CCA CCC AAG GCT GCT CCT CCA AGA CCT QGC x 105 

TFT PTPSVSS Q A L SSPRLPR 3 S3 

ACA TTT ACT CCC ACA CCC AGT GTC TCT AGC CAG GCC CTG TCC TCA CCA AGA CTG CCC AGG 1165 

V *>EPPPQAVSLTPGGVSSSA 373 

GTA GAT GAA CCC CCA CCT CAG GCA GTG TCC CTG ACC CCA GGT GGG GTT TCT TCT TCT GCT 1225 

LSRMGAV PVMVPAQSQAGSL 393 

CTG AGC CGC ATG GGT GCT GTG CCT GTG ATG GTG CCT GCA CAG AGT CAG GCT GGG TCT CTT 128 5 

V 395 

GTG TGA 1291 

TAGCCCAGGCACTCATTAGCTACATCTGGTATCTGACCTTTCTGTAAAGG 1370 

3GGAGGATGCCCACATTCTAGACCTCCAGTCCTTTGCTCCTACCT 144 9 

rTAAAATCTGGGTCAAAGGACAAAAGG AGG AAATGG ACCTGAGGTAGGGGGTTGGGAGTGAGGAGGCTTCACTTCCTCC 1528 

rTGCTTCTCCCTGAAGCCAGATGAATGCTGCGGAAGATCGGCTACCCTCCAAGGGCTCTGGAGGAGA 1607 

JATGCCCCTC^CTCTGTG ATCrcTACAACACCCTTATCTAATGCTO 1686 

ATAACCTGTCCTG CTGGCTTGG CTGGG TTTTG TTGT AG CAG GGGG AT AGG AAAGACATTTTAAAATCTGACTTGAAAT 1765 

3ATGTTTTTGTTTTTATTTTGCAAAT1^ 1844 

1846 
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>iuT258 

MII^AGTPETSLLRVLFLGLSTLAAFSRAQ^ 

EESWSHPREVPILIWFLEQEGKEPNQVLSYINGVMTNKPGTALVHSISSRNVSLRLGALQ 
EGDSGTYRC SVNVQNDEG KS IGHS IKS IELKVLVPPAPPSCSLQGVP YVGTNVTLNC KS P 
RSKPTAQYQWERLAPSSQ VFFG PALDAVRGS LKLTNLS I AMSGVYVCKAQNRVGFAKCNV 
TLDVMTGSKAAWAGAVVGTFVGLVLIAGLVLLYQRRS KTLEELANDI KEDAI APRTLPW 
TKGSDTISKNGTLSSVTSARALRPPKAAPPRPGTFTPTPSVSSQALSSPRLPRVDEPPPQ 

AVSLTPGGVSSSAIiSRMGAVPVMVPAQSQAGSLV 
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ALIGN calculate* a global alignment of two sequences 
version 2.0uPlease citei Kyers and Killer, CABIOS (1989) 

> hT2S8a.a. 370 aa vs. 

> raT2SB a. a. 394 aa 

scoring matrix: paml20.mat, gap penalties: -12/-4 

€2.6% identity; Global alignment score: 108S 

10 20 30 40 SO €0 

inputs MISLPGPLVTMIJCRFLri/jLSALAPPSRAQLQLHLPA- -NRLQAVEEGESGASAVnTTLHREVSSSQPWEV 
• • . « • . ...... .* . ■ . . . » . • « ... * • • • . _ * ; » s « s : 

MILOACTPETSlXRVLFLGLSTIJLAFSRA0MEUm>P^ 

10 20 30 40 50 £0 70 

70 80 90 100 110 120 130 

inputs PFVMWFFKQKEKE-DQVI^YINGVTTSKPG^ 

PILIWFLEQEGKEPNQVLSYINGVhrTMKP^ 

80 90 100 110 120 130 140 

140 ISO 160 170 180 190 200 

inputs RG HS I KTL ELNVLVP PAP PS CRLQGVPKVG ANVTLS CQS P RS KP AVQ YQWDRQL P S FQT F F AP ALD V I RG 

• *•••«•*••••••••«••* • • m • ■ *>*•••«>•••••**•>»•>•••«•>•• • • • ••**»••••■••■ 

IGH5 I KS I EIJCVLVPPAP PSCS LQGVPYVGTNVTLNCKS PRS KPTAQYQWERJLAPS SQVFFG PALDAVRG 
150 160 170 180 190 200 210 

210 220 230 240 2S0 260 270 

inputs S LS LTKLS S SMAGVYVCKAHNEVGT AQCNVT L EVSTG PGAAWAEA WGTL VG LG LtAG LVLL YKRRG KA 

: t : r : : : .$.:::::::.; : : :.:::'::.: : : . : : : : : :::::.::: 

S LKLTKLS I AMSGVYVCKAQNRVG FAXCNVTLDVMTGS KAAWAGAWGTFVGLVL I AG LVLLYQRRS KT 
220 230 240 250 260 270 280 

260 290 300 310 320 330 340 

inputs LEEPANDIKEDAIAPRTLPWPKSSDTISKKGTLSSOT 

:t: :::**:::::.::::. :.:::::::::::::::::::::: . :::«:..:::::.:::::.:: t 
LEELANDIKEDAIAPRTLPWTKGSDTISKHGTl^^ 

29C 300 310 320 330 340 3S0 

3S0 360 370 

inputs HAH DRWGPPSTNI PHPWWGFFLWL 

.::.••**..:* 
LPRVDEPPPOAVSLTPGGVSSSALSRMGAVPVMVPAQSQAGSL-V 
360 370 380 390 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT258a.a. 370 aa vs. 

> SwiesProt Q99795 - (untitled) 319 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 „ , 
23.0% identity; Global alignment score: -102 

10 20 30 40 50 60 

inputs MISLPGPLVTNLXRFLFU5LSM.APPSRAQLQLHLPANRLQAVEEG-ESGASAWYTLHREVSSSQPWEVP 

: • • : • . . : : : 

MVGKMWPVLWTLCA - VT^\n?VBAI S VETPQDV- LRASQGKS VTLPCTYHTSTisSREGLIOWDKLLLTHTER 
10 20 30 40 SO 60 

70 80 90 100 110 120 130 

inputs FVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRVEGLQEKDSGPYSCSVNVQDKQGKSRG 

• • * • - • • « . , ...... . 

• ••«••• .... . ... . .:•••: : * 

WIWPFSNKN YIHG - ELYKNRVSISNNAEQSDASITIDQLTMADNGTYECSVSLMSDLE G 

70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs HSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSKPAVQYQWDR--QLPSFQTFFAPALDVIRG 
• • . * » : s » i » * * * • . . . * . • . ■ . ■ . 

» • **• <•«••• ..... . , . . « . . . . a .. . 

NTKSRVRLLVLVPPSKPECG I EGETI IGNNIQLTCQS KEGS PTPQYS WKRYNILNQEQPLAQPASGQ 

130 140 150 160 170 180 190 

210 220 230 240 250 260 270 

. np u t s S LS LTNLS S S MAG VYVCKAHKE VGTAQCNVTLE VS TG P - G AAVVAE A VVGTLVG LGLLAG LVLL YHRRG K 

PVSLKNISTDTSGYYICTSSNEEGTQFCNITVAN^SPSMKVAIiYVGIAVGVVAALIIIGIIIYCCCCRGK 
200 210 220 230 24 0 250 260 

280 290 300 310 320 330 340 

nputS ALEEPANDIKEDAIAPRTLPWPKSSDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPR 

»•••::::: • ■ • ... • 

• .... ... > 

- - DDNTED - KEDA - - RPNREAYEEP - PEQLRELSREREEE - DDYR 

270 280 290 300 

350 360 370 

iputs HAHDRWGPPSTNI PHPWWGFFLWL 

QEEQR - - STGRES PDH LDQ 

310 
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ALIGN calculates a global alignment of two seouences 
version 2.0uPlease cite: Myers and Miller, CABIOS U989) 

> hT2S0 n.a. lfi6 9 aa vfi< 

> GenBank U79725 - Human A33 antigen precursor mR 2793 aa 
scoring matrix: pamUO.mat, gap penalties: -12/-4 

40.$% identity; Global alignment score: 1182 

10 20 
inputs GTCGACCC--- ACGCGTNCNT CCAG 



- -CTACCCCTTTGTGAGCAGTCTAGGAGTTTGTACACCTGTO 

10 20 30 40 50 60 

30 40 50 

inputs GTNC GG AG - CCGC CCT GGGTGTCA - GCG - GC 

GGTTTAAGGGGAACTTGAGGGAAGTAGGGAAGACTCCTCTTGGGACCTTTGGAGTAGGTGACACATGAGC 
70 60 90 100 110 120 130 

60 70 80 90 
i npu t S TCGGCTCCCGCGC AC - - GC TCCGGCCGT CGCGC - AGCCT CGGCA C C 



CCAGCCCCAGCTCACCTGCCAATCCAGCraAGGAGCTCACCTGCCAATCCAGCTG 
140 150 160 170 180 190 200 

100 110 120 
inputs. TGC AGG TCC GTGC - - GTCCCG - - - CGGCTGGCGCC CCTG 



GGGTGAGAAGAGGGA^AATTGCAGGGACCTCCAGTTGGGCCAGGCCAGAAGCTGCTGTAGCTTTAACCAG 
210 220 230 240 250 260 270 

130 140 150 
i npu t S AC--- TCCGTCC CGGCCAGGGA GGGC - - CATGA 

ACAGCTCAGACCTGTCTGGAGGCTGCCAGTGACAGGTTAGGTTTAGGGCAGAGAAGAAGCAAGACCATGG 
280 290 300 310 320 330 340 

160 170 180 
inputs TTT - CC CTCCCGGGGCC- -CCTGGTGACCAAC TTGN 

TGGGGAAGATGTGGCCTGTGTTGTGGACACTCTGTGCAGTC 
350 360 370 360 390 400 410 

190 200 210 220 
inputs TGC GGTTTTTGTTC CTGGGGCTGAGTG CCCT-C GCGCC- -CC 

• • • • * • • * * • • ****** * » * I * ***** * * 

AACTCCGOVGGACGTTCrTCGGGCTTCGCAGGGA^ 
420 430 440 450 460 470 480 

230 240 2S0 260 

inputs -CCTC GCGGGCC CA GCTGCAACT-GCACTTGC CCGCC 

: ::: : :: . tist.st.i.t.s.t : i 

ACCTCCAGTCX3AGAGGGACTTATTCAATGGGATAAGCTCCTCCT 
490 500 510 520 530 540 550 

270 280 290 300 310 320 

inputs AACCGGTTGCAGGCGQTGG-- AGGAGGG - - -GGAAAGTGGTGCTTCAGCATGGTACACCTTGC 

* * t x \ 1 lt 1 s * * * .ss.:.s ••in*. x : nun:. st • t:: 

GGCCGTTTTCAAACAAAAACTACATCXATGGTGAG<nTrATA 
S60 S70 560 590 600 610 620 

330 340 356 360 

inputs A CAGGGAGG IX3TCTTCATC - CCA -GCCATOGGAGG TGC-CCTT- -TGTGATGT 

*** »•** si ssst lis .lllttl .1. ti sit. .til. I t 

TOAOCAOTOOOATQCCTCCA^ 
€30 640 650 660 670 660 690 

370 380 390 400 410 
Inputs GGTTCT TCAAAC — AGAAAGAAAAGGAGGATCAGGTGT •-TOTCCT 

' »* Slt.lt . II..I IK , •••III III I lltlt 
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OTCTXlXlLnUATCm^ACCTCre 
7 00 7X0 720 730 740 750 760 

420 430 440 

inputs ACATCAA TGGGGTCA - CAACAAGCAAACCTG - -GAGTATC 

• ♦ : 8 • 

CCAAACCAGAATGCGGCATCQAGGGAGAGACCATAATTGGG 
770 780 790 800 810 820 • 830 

450 460 470 480 
inputs CTTGGTCTACTCCATGCCCTC CCGGAA CCTGTC 

GGAGGGCT - CACCAACCCCTCAGTACAGCTGG AAG AGGTACAACATCCTG AATCAGGAGCAGCCCCTGGC 
840 850 860 870 880 890 900 

490 500 510 520 
inputs CCTGC - - - GGGT - GG AG G GTCTCC - AGG AG AAAG ACTC - - - TGGCCC - - CT ACAGCTG - 

CCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATCTGT 
910 920 930 940 950 960 970 

530 540 550 560 S70- 
inputs - - CTCCGTG AATGTGCAAGACAAACAAG - - - GCAA * ATCTAGG - GGCCA - CAG CAT CA 

ACCTCCAGCAATGAGGAGGGGACGCAGTTCTGCAACATCACGGTGGCCGTCAGATCTCCCTCCATGAACG 
980 990 1000 1010 1020 1030 1040 

580 590 600 610 620 
inputs AAACC TTAG AACTCAATGTAC - TGGTT CCTC CA - - -GCTCCTCCATC- -CTG 

TGGCCCTGTATGTGGGCATCGCGGTGGGCGTGGTTGCAGCCCTCATTATCATTGGCATCATCATCTACTG 
1050 1060 1070 1080 1090 1100 1110 

630 640 650 660 670 
inputs C- - CGTCT - CC AGGGTGTG - -C CCCATGTG GGGGCAAACGTGACC CTGAGCTGCCAGT 

CTGCTGCTCCCGAGGGAAGGACGACAACACTGAAGACAAGGAGGATGCAAGGCCGAACCGGGAAGC 
1120 1130 1140 1150 1160 1170 1180 

680 690 700 710 
Inputs - - CTCCAAGG - AG - TAAGCCCGCTGTCCAA TAC- - -CAGTGGG 

GAGGAGCCACCAGAGCAGCTAAGAGAACTTTCCAGAGAGAGGGAGGAGGAGGATGACTACAGGCAAGAAG 
1190 1200 1210 1220 1230 1240 1250 

720 730 740 750 760 

nput s ATCGG CA GCTTCCATCC - - -TTCCAG-ACTTTCTTTG-CA- - CCAGCATTAG ATGTCATCC 

' ■ * * 9 W • • * ■ * * * • * * « • « 9 ****** • * 

AGCAGAGGAGCACTGGGCGTGAATCCCCGGACCACXn<X3ACCAGTGAC^ 
1260 1270 1280 1290 1300 • 1310 1320 

770 780 790 800 810 820 

nputs GTGGG- TCTTTAAGCCTCA - CCAA CCTTTCGTCTTCCATGGCTGGAGTCTA-TGT 

* • * » • :;.if • ti x. xx.. ix ::: sit * ::: ::: 

GAXX1AAGGGTTAGGGGTTCATTCTXXXGCTTCCTC 
1330 1340 1350 1360 1370 1380 1390 

830 840 650 860 670 

n P uts CTOCAAQGCCCA- -CAATGAGOTGGGCA- - -CTGCCC-AATGTAA- -TOTG AC---GCTGQ 

* • JSt: :i.tis x x x • x x x x :n x • x x x : x x x x :;• x x x x i 

CCCltXATCCCAOACATTOATOGGGACATTX^ 
.1400 . 1410 1420 1430 1440 * 1450 1460 

.880 690 900 910 920 930 
nputs -AAQTOAO-CAGAOOSCC^ GOTTO-OACT 

lit t.X X I.I XX IX.. t}ll.l. X .$11. tlXX mmt . . lilt XX XX XXI. 

•-CTGTGAAGTAACCCCTCCTGGCTGTOACA 
1470 1480 1490' .1500 1510 1520 1S30 

940 950 960 970 '960 
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inputs COOQTTOCTQQCTQOCC TGOTOCT- -CTTOTACCACCOC COO OOCAAO-OC- 

s in tit ttt st ..tt.tt.t tt tttt.t ti 

CCTOOTOCOOOCClXKKrCCTCACTCAAOACCAC^^ 
1S40 1550 1560 1S70 1580 1590 1600 

990 1000 1010 1020 
inputs CCTGGAGG- - -AGC-CAG . CCAAT- -GATATC AAGOAOG - -ATGCCAT 

CCTGGAAGCACAGCGCTGAGCATGGGGCGCTCCCACTCAGAACTCTCCAGGGAGGCGATGCCAGCCTTC 
1610 1620 1630 1640 1650 1660 1670 

1030 1040 1050 
inputs TGCTCCCCGG A - - -CCCTGC-CCTGG C - CCAAGAG - -CTCAG 

GGGGTGGGGGCTGTCCTGCTCACCTGTGTGCCCAGCACCTGGAGGGGCACCAGGTGGAGGGTTTGCACTC 
1680 1690 1700 1710 1720 1730 1740 

1060 1070 1060 1090 
inputs - AC AC AATCTCCAAG AATG - - GGACCCT TT CCTCTGT 

CACACATCTTTCTTGAATGAATGAAAGAATAAGTGAGTATGCTTGGGCCCTGCATTGGCCTGGCCTCCAG 
1750 1760 1770 1780 1790 1800 1810 

1100 1110 1120 
i npu t s CACC TCCGCACG AGCC CT- CCGG CCA - - CCC - C ATGGCC - - C 



CTCCCACTCCCTTTCCAACCTCACTTCCCGTAGCTGCCAGTATGTTCCAAACCCTCCTGGGAAGGCCACC 
1820 1830 1840 1850 1860 1870 1880 

1130 1140 1150 1160 1170 
i npu t s TCCCAGGCCTGGTGCATTGACCC CC ACGCCCAGTCTATC - CAGCC AGGC - 



TCCCACTCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCACACACTTTCCATCTCTGCCTGTCAATATCG 
1890 1900 1910 1920 1930 1940 1950 

1180 1190 1200 1210 1220 1230 

inputs - - CCTGCCCTCACCAAGACATGCCCACGACAG ATGGGGf - - CCC - - ACCCTCAACCAATATCCCCCATCCC 

TACCTGTCC - CTCCAGGCCCATCTCAAATCACAAGGATTTCTCTAACCCTATCCTAATTGTCCACATACG 
1960 1970 1980 1990 2000 2010 2020 

1240 1250 1260 1270 1280 
inputs TGGTGG GGTTT- - -TTTCCTTTGGCTT TGAGCCGCATGG GT- -GCTGNGC 



TGGAAACAATCCTGTTACTCTGTCCCACGTCCAATCATGGGCCACAAGGCACAGTCTTCTGAGTO 
2030 2040 2050 2060 2070 2080 2090 

1290 1300 1310 1320 1330 

Inputs CTGTG ATGGNGC - -CTGC-CCA-OAGTCAAG - -CTGGCTCTC -TGO - -TATGATGACCC C 

s : t . t s ::. s . : : t • i :::: ::: ::: : . : : : • : : : : 
TCTCACTGTATTAGAGCX3CCAGCTCXTTGGGGCAGGGCCTGGGC 

2100 2110 2120 2130 2140 2150 2160 

1340 1350 1360 1370 1380 

inputs AC--- CACTCAT TGG- --CTAAAG- - G ATTTGGGGTCTCTCCTTCCT ATAAGGGT 

i :.i ss: m ::.stx t.nn::. .:t ::: tt 

CCTAQTAQCTQQCQCCCATCCTAGTOQQCACCTAAGCCTAATTQ 

2170 2180 2190 2200 2210 2220 2230 

1390 1400 1410 1420 1430 

inputs --CAC-- CTCTAO-CAC AOA-GOCCnXSAGTCATGGOAAAOAGTCACACTCCTGAC^ TTAG 

t : ttif.i tm't ttt t. t tt.tt .tt.t.tt tt its tt.»tttt tt • 

TTCccrrcTxrrooTCTCci^^ 

2240 2250 2260 2270 2280 2290 2300 

1440 1450 1460 1470 1480 1490. 1500 

inputs TACTCI OO COOCAOCTCXCTT P ^^ 

i*s. tt t t»t ,tt. t t t tt « s.tt t t. tt .i.t.tt 

TTCAQTQAOTTAAACAOQAATTaATTTAAAQTQAACA 

2310 2320 2330 . 2340 2350 2360 * 2370 
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15X0 1520 1530 1540 1550 1560 

i npu ts AG G AG AAG AGG AAGT GG ATCTGG AATTGGG AGGAGCCTCCACCCACCCCTG ACTCCTC 

• s • • • 

GTTCTTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATAATTGGCATGTCTCCTCAA 

2380 2390 .2400 2410 2420 2430 2440 

1570 1580 1590 1600 1610 1620 
inputs CTTATGAAGCCAGCTGCTGAAATTAGCTACTCA- -CCAAG AG TGAGGGGCA - GAG ACTTC 

*••«••••••*•• •••• " * • " " " • • t • * * • 

CTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCTGGT 
2450 2460 2470 2480 2490 2500 2510 

1630 1640 1650 1660 1670 

i npu t s C AG TCACTG AGTC - - TCCCA - GGCCCCCTT GATCTGTACCCCACCCCTATCTAACAC 

GGGGCTGTC-CTGCTGCCCTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCTC 
2520 2530 2540 2550 2560 2570 2580 

1680 1690 1700 1710 1720 1730 

inputs C ACCCTT - - GGCTCCC A CTCCAGCTCCCTGTATTG ATATAACCTGTCAG - - GCTGGCTTGGTT 

• • • • • • • • • • ••• • • • «•••••• • • • 

*• * » • •••« • ■ ■ « • a a • a a • ■ • « ■ • • • • • • ■ a • • • • • ■ • 

AGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTGGTCTCACCCAACTCCAGTTTACT 

2590 2600 2610 2620 2630 2640 2650 

1740 1750 1760 1770 1780 1790 

i npu t S AGGTTTTACTGGG - GCAG AGG ATAGGGAATC - TCTTATTAAAACTAAC - ATGAAATATGTGTTGT 

• •••»•• • • * ■ ■ a a • a * a * » « • a . a • a a a a • 

•••m*«9»» • • a * a • a • a a a • a • a • a • • a a aa.aaaaaaaaaa 

TATCTTTTCTGGATGATCAGAAAAATAATTCCATAA^ 

2660 2670 2680 2690 2700 2710 2720 

1800 1810 1820 1830 1840 1850 1860 

. npu t s TTTC ATTTGC AAATTTAAATAAAG ATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGG 

•■• • • • • • aa,a a m • « • a a* Z a • 8 a S • a m»rmm»*******»**»* m * 

ATATTTTTATATATATTGTTAAATCCTTTGCTTCAT - TCCAAATGCTTTC AGTAATAATAAAATTGTGGG 
2730 2740 2750 2760 2770 2780 2790 



nputs CGGCCGC 
TGG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT258 a. a. 394 aa vs. 

> SwissProt Q9979S - (untitled) 319 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

23.0% identity; Global alignment score: -149 

10 20 30 40 50 60 70 

inputs MILQAGTPETSLLRVLFLGLSTLAAFSRAQMELHVPPGL 

• « ■ ■* : : : . ••: • : : • : :»:: : • • . 

MV -GKMWPVLW TLCAVRVTVDAISVETPQDVLRASQGKSVTLPCTyHTSTSSREGLIQWD 

10 20 30 40 50 

80 90 100 110 120 130 140 

inputs PILIWFLEQEGK£PNQ\^SYINGVMTNKPGTALV^ 



KLliLTHTERWIWPFSNKNYIHGELYKNR-VSlSNNAEQSDASITIDQLTMADNGTyECSVSLMSDLE-- 
60 70 80 90 100 110 120 

150 160 ;70 180 190 200 210 

inputs IGKSIKSIELK^^ J VPPAPPSCSLO^VPWGT^A^^LNCICSPRSKPTAQYQWERrAPSSQVFFGPALDAVRG 



-GNTKSRVRLLVLVPPSKPECGIEGETIIGNNIQIiTCQSKEGSPTPQYSWKRYNILNQE--QPLAQPASG 
130 140 150 160 170 180 190 

220 230 240 250 260 270 

inputs - SLKLTNLS I AMSGVYVCKAQNRVGFAKCNVTLDVMTGS - KAAWAGAWGTFVGLVLI AGLVLLYQRRS 



QPVSLKNISTDTSGYYICTSSNEEGTQFCNITVAVRSPSMNVALYVGIAVGWAAIil I IG - - 1 1 1 YCCCC 
200 210 220 230 240 250 260 

280 290 300 310 320 330 340 

nputs KTLEELANDIKEDAIAPRTLPWTKGSDTISKNGTLSSVTSARALRPPKAAPPRPGTFTPTPSVSSQALSS 



RGKDDNTED - KEDA RPNREAYEEPPEQ LRELSR 

270 280 290 

350 360 370 380 390 

nputs PRLPRVDEPPPQAVSLTPGGVSSSALSRMGAVPVMVPAQSQAGSLV 



BREE- - EDD YRQEEQRSTGRES PDHLDQ - 
300 310 
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align calculates a global alignment of two sequences 
version 2.0uPlease cltei Myers and Miller, CABIOS (1989) 
* J 12 ? 8 1846 aa vs. 

> GenBank U79725 - Human A33 antigen precursor mR 2793 aa 
scoring matrix: paml20.mat, gap penalties! -12/-4 
40.0% identity; Global alignment score: 908 

10 20 30 

i nput S GTCGACCC ACGC-GTC CG - -GTGCAC- - ATT - - - - C - - OGGTTGCCGCC 

5 .mm: : 

- -CTACCCCTTTGTCAGCAGTCTAGGACTTTGTACACXrrGT^ 

10 20 ' 30 40 50 60 

40 50 60 70 
inputs G CT CACC - CACAACACCTGTAG AC AC- CGTGTGT 

GGTTTAAGGGCAACTTGAGGGAAGTAGGGAAGACTCCTCTTGGGACCTTTGGAGTAGGTGACACATGAGC 
7 0 80 90 100 110 120 130 

00 90 100 110 

inputs CCAAC TCTCC CTG AGTA - CTC CGGGCCA AGG - AGGGCCATGAT 



CCAGCCCCAGCTCACCTGCCAATCCAGCTGAGGAGCTCACCTGCCAATCCAGCTGAGGCTGGGCAGAGGT 
*«0 ISO 160 170 180 190 200 

120 130 140 150 160 

inputs TCTTCAG GCTGGAACCCCCGA- - -GACCAG-C- - -TTGCTGCGGGTT-TTGTTCCTG 

GGGTGAGAAGAGGGAAAATTGCAGGGACCTCCAGTTGGGCCAGGCCAGAAGCTGCTGTAGCTTTAACCAG 
210 220 2 30 24 0 250 260 270 

170 180 190 200 210 

inputs G- G ACTG AGTACCCTTGCTGCCTTCTCCCG AGCTCAG ATGGAGTT GCA CGTGCCC - - 

...mm. : : : : : : mm., m : 

ACAGCTCAG A - - CCTGTCTGG AGGCTGCCAGTG ACAGGTTAGGTTTAGGGC AG AG AAG AAGC AAGACCAT 
280 290 300 310 320 330 340 

220 230 240 250 

ln P uts CC GGGC-CTCAA- - CAAATTGG AAG - CGGTAG AGGGAGAAG AAGTG 

M mm ; ; : . . : : . . : : : : : : . . - : : : 

GGTGGGG AAG ATCTGGCCTGTGTTGTGGACACTCTGTGCAGTCAGGGTOA 
350 360 370 380 390 400 410 

260 270 280 290 300 

inputs GTGCTCCCCGCCTG- -GTACA-CGA TGGCACGGGAGGAGT CGTGGTCC 

s • • . : m m ; 

G AA - ACTCCGCAGGACGTTCITCGGGCTTCGCAGGG AAAG AGTGTCACCCTC 

420 430 440 450 460 470 480 

310 320 330 340 3S0 

inputs - -CACC-CC OGGGAGGTGCCCATCCT GATCTGGTTCT TGGAACAAGAAGGGAAGGAA 

X 1 * 1 8 : - : :: :. ::: . s : :: : : 

TCCACCTCCAGTCGAGAGGGACrTATTCAATXSGGATAAGCTC 

490 500 510 520 530 540 550 

360 370 380 390 400 

inputs CCAAACCAGGTQTTGTCTTA CATTAATGGAGTCATGACAAATAAACCTG - - - 

t . . . m . I . . m t . . i t.u:...tt.::. i.t:. .«:.::.. 

TCTGGCCGTTTTCAAAOUUUUVCT 

560 570 580 S90 600 610 620 

410 420 430* 440 450 

inputs GAACAGCCCTOGTCCAC--TCT ATCT CTTCACGGAATOTGTC-CCTGOG- 

1 1 • 1 1 1 ** • n. ttt. t.i .t si • • t x itt.il 

TOCTOAOCAOTCOOATXKXTCCATCAC^ 

«0 640. $50 660 670 660 . 690 

460 * 470 480 " 490 500 _ 

inputs -C CTGGGGaCACTCCAGGAGGQAaACTCTGGGAC TTACCGCTGTTCTGTCAATGTGC- - - 

« its. s i: .u.ttntt Mt.t t .mm j m: . mm 
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TCTGTCTCGCTGATGTCAGACCTCKJAGGGC^ 

700 710 720 730 740 750 760 

510 520 530 540 550 

inputs AG AATG ATG AAGGCAA - -AAGTATAGGCCACA GCATCAAAAQCATA - -GAGCT- -CAA- - 

s. • ttti. s»: : ::.::t : : : : :: us 

CCTCCAAACCAGAATGCGGCATCGAGGGAGAGACCATAATTGGGAACAACATCCAGCrG 

770 780 790 600 810 820 830 

560 570 580 590 600 610 620 

inputs AGTGCTGGTTCCTCCAGCTCCTCCATCCTGTAGTTTACAGGGTGTAC - -CCTATGTCGGGA CCAAT 

: . . : - : : : : : : .> : : . : . : ... : : : . . : : : : : 

AAAGG AGGGCTC ACCAACCCCTCAGT ACAGCTGGAAG - AGGTACAACATCCTG AATCAGG AGC AGCCCCT 
640 pSO 860 870 880 890 900 

630 640 650 660 
inputs GT G ACC CTG AACTGCAAGTCCCCAAGG AGTAAA ---CC TACTGC - TC 



GGCCCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATC 
910 920 930 940 950 960 970 

670 660 690 700 710 
i npu t s AGTACCA GTGGG AG AG - - GCTG GCCCCATC - CT CC- - CAGGTCT TCTTTGG 

TGTACCTCCAGCAATGAGGAGGGGACGCAGTTCTGCAACATCACGGTGGCCGTCAGATCTCCCTCCATGA 
980 990 1000 1010 1020 1030 1040 

720 730 740 750 760 

inputs AC - C AG CCTT AG ATG CTGTTCGTGGATCTTTAAAGC TCACTAACCTT TC - -CAT 

ACGTGGCCCTGTATGTGGGCATCGCGGTGGGCGTGGTTGCAGCCCTCATTATCATTGGCATCATCATCTA 
1050 1060 1070 1080 1090 1100 1110 

770 780 790 800 

inputs TGCC ATG TCTGGAGTCTATGT - - CTGCAAGGCTCAAAACAG AGTGG 



CTGCTGCTGCTGCCGAGGGAAGGACGACAACACTGAAGACAAGGAGGATGCAAGGC-CGAACCGG - GAAG 
1120 1130 1140 1150 1160 1170 1180 

810 820 830 840 
inputs GCTTTG -CCA- - AGTGCAAC GTGACCTT GGACGTGATG - -ACAGG- - 

• • • • ... ...... • • • i s s s s s 

• ■ . • ........ ....... •••..«••• 

CCTATGAGGAGCCACCAGAGCAGCTAAGAGAACTTTCCAGAGAGAGGGAGGAGGAGGATGACTACAGGCA 
1190 1200 1210 1220 1230 1240 12S0 

850 860 870 680 

inputs GTCCAAGGCTGCAGTGGTCG-- CTGG - - AGCAGTTGTGGG 

I.::..:.:::::::: : . : : 

AGAAGAGCAGAGGAGCACTGGGCGTGAATCCCCGGACCACCTCGACCAGTGACAGGCCAGCAGCAGA 
1260 1270 1280 1290 1300 1310 1320 

690 900 910 920 

Inputs CA- C I TTTCTTGG O TT O OTQ CTGATAGCTGGGCT GGTCCTGTT-- 

2.: . : . . : t : t : . : : : t Mit t: .jusstt 

CGGCGGAGGAAGGGTTAGGGGTTCATTCnXXXXKriTCCrGGCXrrcC 

1330 1340 1350 1360 1370 1380 1390 

930 940 

nputs GTACCAG - -CGCC — GGAGCAAGAC 

. : 1 1 : s : : :i:«st.:*t 

CCTOTCOCTCCATCCCAGACATTGATGGGGACATTTCTTCCC^ 

1400 1410 1420 1430 2440 1450 1460 

950 960 970 980 

nputs CTTGGAA GAGCTGG-CCAA-TGA TATCAAG - G AAGATGCC ATT 

* t s s • i t.nti. n.« ttt t.tt *t tnt**«u * t 

CCTGGTAAGGGGGTCCCTGTGCIXJATCCTGCTGACCTCACTG 

1470 1480 1490 1500 1510 1520 . . 1530 

990 1000 1010 1020 
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inputs OCTCCC CGOACCTTOCCTT GGACCAA AOGCTC AGACACAA 

IA;AJUv a ^J!i , •iiL ss, 5 .tn ii. n in i 

ACACCTUOTCKXKWCCTCKKXXI^ 

1540 1550 1560 1570 1580 1590 1600 

1030 1040 1050 1060 1070 1080 

inputs TCTCCAAGAATXK5-GACACTTT-CTTCX^ G CG- -GCCACCCA 

n ~*LlL 2:5 : • : : • s : : : : 

GCTCCTGGAAGCACAGCGCTGAGCATGGGGCGCTCCCACTCAGAACTCT 

1610 1620 1630 1640 16S0 1660 1670 

.- 

1090 1100 1110 
inputs AGG CTGCTC- -CT CCAAGACCTGG CAC ATTTACT 

' : : . . :::::: : ; ; : : 

TGGGGGGTGGGGGCTGTCCTGCTCACCTGTGTGCCCAGCACCTGGAGGGGCACCAGGTGGAGGGTTTGCA 
1660 1690 1700 1710 1720 1730 1740 

„ 1120 1130 1140 1150 

inputs C-CCACAC- -C- - -C - AGTGT -CTCTAGCCAGGCCCTGTCCT- - -CAC 



CTCCACACATCTTTCTTGAATGAATGAAAGAAT^^ 

1750 1760 1770 1780 1790 1800 1810 

1160 1170 1180 1190 1200 1210 

inputs CAAGACT- - - GCCCAGGGTAGATG AACC - CCCACCTCAGGCAGT - - GTCCCTG ACCC - - CAGGTGGGGTT 

CAGCTCCCACTCCCTTTCCAACCTCACTTCCCGTAGCTGCCAGTATG^ 

1820 1830 1840 1850 1860 1870 1880 

1220 1230 1240 1250 

.nputs TCTTC TTCTGCTCTGAGCC -GCATGGG TGCTGTGCCTGT - GATG 

* : lis::.:.:.:: : . 

ACCTCCCACTCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCACACACTTTCCATCTCTGCCTC 

1890 1900 1910 1920 1930 1940 1950 

1260 1270 1280 1290 1300 1310 

.nputs - -GTGCCTG- - -CACAGAGTCAGGCT-GGGTCTCTTGTGTGA- - -TAGCCCAGGCACTCATTAGCTACAT 
*rJLLlLLll : * : " : * :: -••;!•:.-•! • : :: : :::: 

TCGTACCTGTCCCTCC AGGCCCATCTCAAATCACAAGGATTTCTCTAACCCTATC - CT AATTG TCCAC AT 
I960 1970 1980 1990 2000 2010 2020 

1320 1330 1340 1350 1360 1370 

nputs -C-TGGTATCTGACCT- - TTCTGTAAAGGTC - TCCTT - - GTGGCACAG AGG ACTCAATCTT - -GGG AGG A 
: i : iz 5 . . : ... . : . m . # : # . . # . . . , ... 

ACGTGGAAACAATCCTGTTACTCTGTCCCACGTC^ 

2030 2040 2050 2060 2070 2080 2090 

1380 1390 1400 1410 1420 

nputs TGCCCACA- - -TTCTAGACCTCCAG -TCCTTTG - - CT- - -CCTA- -CCTC CTT- - -CTAT- ? -TGT 

^vliLJLljL 5 ^™!* m« i mi tin* i tin in ::. 

TGCTCTCACTOTATTAGAGCGCCAGCTCCTTG^ 

2100 2110 2120 2130 2140 2150 2160 

1430 1440 1450 
nputs TG GAATACTGG-GCC- -TC- -AGTAAG-ACTAAA ATCTG- 



*J» * it* it in.. i tn.tt 

AGCCCTAGTAGCTGGCGCCCATCCTAGTGGGCAOT 

2170 2180 2190 2200 2210 2220 2230 

nnilhe 1470 1480 
pUtB GGTCA - AAGOACAAAAGGAGOAAAT- GGACC 



TCCTTCCCTTCTCTGOXClXXrrTO 

2240 2250 2260 2270 2280 2290 2300 

1490 1500 K ^ 1510 1520 1530 
Q P Uts TQAGGTAGG- - -GGGTTGGGAGTGAGGAGGCT-TCACTT CCTCCCXOCT7 TCT- 

_ 1 * ' 1 » * • • t..ttt. . ..if ...1. • 1 1 1 • • 1 s 1 titl: lis 

CATTCAGTGAGTTAAACAOGAATTOATTTAAAGTGUUICACACA 

2310 2320 2330 N 2340 2350 2360 2370 
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1540 1550 1560 1S70 1580 
inputs - - -CCCTGAAGCCAGATGAATGCT- GC- -GGAAGATCGGCT- - ACCCTCCAAGGGCT 

AGTTCTTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATAATTGOCATGTCTCCTCAACCC 

2380 2390 2400 2410 2420 2430 2440 

1590 1600 1610 1620 1630 1640 

inputs C - TGG AGGAGACTGCCAGTCAGTGATGC - -CCCTGGCTCTG TGATCTGTAC AACACCC - TTATCTAA 

• • • • * a a a a • * , • • * • • ••••• 

• • • «. • a a a a a a • • • a • a . a a a • ■ a a a 

CCTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCTGG 
2450 2460 2470 2480 2490 2500 2510 

1650 1660 1670 1680 1690 
inputs TG - - - CTGTCCTT - TGCCGTTCGCTCCATCTCC - - CTGT - ATTAATATAAC 



TGGGGCTGTCCTGCTGCC-CTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCT 
2520 2530 2540 2550 2560 2570 2580 

1700 1710 1720 
inputs CTGTC CTGCT GGCT -TGGCTGG GTTT- -TGTTG 

» * a a a a • ■ • • a • • «••• a • a • a 

CAGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTGGTCTCACCCAACTGCAGTTTACTATTG 
2590 2600 2610 2620 2630 2640 2650 

1730 1740 1750 1760 1770 

inputs TAG C AGGGGG AT AGGAAAG AC ATTT - - TAAAATCTG ACTTGAAATTGATGTTTTTGTT 

• • ••••• • a • a • a a a a 2 a • a * «•••«•••• J a a # a a a a a aa m * » m * * » 

CTATCTTTTCTGGATGATCAGAAaAaAATaAATTCCATAAATCTATTGTCTACTTGCG.^ 

2660 2670 2680 2690 2700 2710 2720 

1780 1790 1800 1810 1820 1830 1840 

i npu t S TTTATTTTGCAAATTTCAATAAAGA TACATCG CATTTGCATGGAAAAAAAAAAAAAAAGGGCG 



TATATTTTTATATATATTGTTAAATCCTTTGCTTCATTCCAAATC 

2730 2740 2750 2760 2770 2780 2790 



inputs GCCGC 
T--GG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> HT258 n.a. 1869 aa vs. 

> pecani n.a. 2557 aa 
scoring matrix: paml20.mat, gap penalties: -12 /-4 

40.5% identity; Global alignment score: 1546 

10 20 . 

inputs G TC GACC CAC GCGTNCNTC-CAGCGTN- 



GAATTCCGGGAGAAGTGACCAGAGCAATTTCTGCTTTTCACAGGGCGGGTTTCTCAACGGTGACTTGTGG 
10 20 30 40 50 60 70 

30 40 50 60 70 80 
inputs -CGGAGCCGCC — CTGGG — TGTCAGCGGCTCGGCTCCCGCGCACGCT CCGGC CGTCGC 



• * • • 



GCAGTGCCTTCTGCTGAGCGAGTCAT-GGCCCGAAGGCAGAACTAACTGTGCCTGCAGTCTTCACTCTCA 
80 90 100 110 120 130 

90 100 110 120 130 

inputs GCAGCCTCGGCA — CCTGCAGGTCCG TGCGTCCCG CGGCTGGCGCCCCTGACTCCGTC 

• •»*•* • • •••• a • • • • 

• ••••• • • ••• a . • • • • ■ • • »••••• • • » • * ■ • v • 

GGATGCAGCCGAGGTGGGCCCAAGGGGCCACGATGTGGCTTGGAGTCCTGCTGACCCTTCTG-CTCTGTT 
140 150 160 170 180 190 200 

140 150 160 170 180 
inputs CCGGCCAGGGAGGGCCATGATTTCCCT — CCCGG — GGCC CCTGGTG A- CC AAC T 

• .«•*. • . i • . ......... «• * * • . «•• • •••••• *•*.« • 

CAAGCCTTG-AGGGTCAAGAAAACTCTTTCACAATCAACAGTGTTGACATGAAGAGCCTGCCGGACTGGA 
210 220 230 240 250 260 270 

190 200 210 220 230 

inputs TGNTGCGGTTT TTGTTCCTGGGGCTG-AGTGC — C-C TC-GCGCCCCC-CTCGCG GGCC 

..:..::::. ::: ::::: : : :: ::.:: :: :::.:: •::: 

CGGTGCAAAATGGGAAGAACCTGACCCTGCAGTGCTTCGCGGATGTCAGCACCACCTCTCACGTCAAGCC 
280 290 300 310 320 330 340 

240 250 260 270 280 

inputs -CAGCTGCAACTGC ACTTGC C CGCCAACCGGTTGCAGGCGGTG 

::::•::•::: : : : : : : :::.: :••:••:•:•: 

TCAGCACCAGATGCTGTTCTATAAGGATGATCTGCTGTTTTACAAC^TCT 
350 360 370 380 390 400 410 

290 300 310 320 330 340 

inputs GAGGAGGGGGA AAGT — GGTGCTTCAGCA-TGGTACACCT TGCACAGGGAGGTGTCTTCATC 

s .. :ssi ::. ::. ..x. .:: tit i it tt.t it itt ..::•: 

AGTTATTTTATTCXrrGAAGTCCX^ATCTATGAC^ 

420 430 440 450 460 470 480 

350 360 370 380 390 

inputs CCAG CCA-TGGGAGGTGCC — CTTT — GTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATC 

:: ::: •••::.:: x: : :::••» :• : ::••: :..:..: . •::: 
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AAAGAGAAAACCACTGCAGAGTACCAGCTGTTGGT6GAAGGA6T6CCCAGTCCCAGGGT6ACACTGGACA 
490 SOO 510 520 530 540 SSO 

400 410 420 430 440 4S0 

inputs AGGTGTTGTCCTACATCAATGGGGTCA CAACAAG— CAAACCTGGAGTAT CCTTGG-T 

s:... .: :s..: . ..tttt.ti. :. s: .. . 

AGAAAGAGGCCATCCAAGGTCGGATCGTGAGGGTCAACTGTTCTGTCCCAGAGGAAAAGGCCCCAATACA 
S6 ° 570 580 590 .600 610 620 

460 470 480 490 SOO 
inputs CTACTC CATGCCCTCCCGGAACC — TGTCCCTGC— GGGTGGAGGG TCTC 



: : : t t • t : : : : : . : : : : : 

CTTCACAATTGAAAAACTTGAACTAAATGAAAAAATGGTCAAGCTGAAAAGAGAGAAGAATTCTCGAGAC 
630 6 40 650 660 670 680 690 

510 520 530 S40 550 

inputs C AGG AG AAAG ACTCTGG CCCCTAC -AGGTGCTCCGTGAATGTGC — AAGACAAACAA 

: • . s.:.s .::s . • 

CAGAArTTTGTGATACTGGAATTCCCCGTTGAGGAACAGGACCGCGTTTTATCCTTCCGATCTCAAGCTA 
700 710 720 730 740 750 760 

560 570 580 590 
inputs GG— CAAATCTAGGGGCCA CAG -CATCAAAA CCTTA GAACTCAATG 



* • ; i - * 

GGATCATTTCTGGGATCCATATGCAGACCTCAGAATCTACCAAGAGTGAACTGGTCACCGTGACGGAATC 
770 780 790 800 810 820 830 

600 610 620 630 

inputs -TACT GGTTCCTC— — CAGCTCCTCC ATCCTG C-CGTCTCCA — GGGTG 

: • : ! : s : s : : . : : : : : j : . . j 

CTTCTCTACACCCAAGTTCCACATCAGCCCCACCGGAATGATCATGGAAGGAGCTCAGCTCCACATTAAG 
840 850 860 870 880 890 900 

640 550 660 670 680 690 

inputs TGCCCCATG-TGCGGGCAAACGTGACCCTG-AGCTGCCAG TC TC -CAAGGAGTAAG 

ss: s::: s s • s J ::::: : SJ :• :::::»...: 

TGCACCATTCAAGTGACTCACCTGGCCCAGGAGTTTCCAGAAATCATAATTCAGAAGGACAAGGCGATTG 
910 920 930 940 950 960 970 

700 710 720 

inputs CCC GCTGT C CAATACCAGTG-GGATC GGCAGCTT 

**' ttttt : : : . : ::: : ixxt.tt. 

TGGCCCACAACAGACATGGCAACAAGGCTGTGTACTCAGTCATGGCCATGGTGGAGCACAGTGGCAACTA 
980 990 1000 1010 1020 1030 1040 

730 740 750 760 770 

Inputs C-CATCCT TCCAGAC TTTCTTTG — CACCAGCATTAGATGTCATCCGTG — GGTCTTTA 

* *•* s * ***** * «•** ** xxxxxx x. xtxt.t .. x..tx.t. 

CAOGTGCAAAGTGGAGTCCAGCOGCATATCCAAGGTGAGCAGCATC-GTGGTCAACATAACAGAACTATT 
1050 1060 1070 1080 1090 1100 ' 1110 

780 790 800 810 820 830 
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input 8 AGCCTCACC-AACCTTTCGTCTTCCAT GGCTGGA GTCTATGTCTG — CAA6GCC — CACA 

• ss« >ss .s is .nn: j,i . tin: ss •:•:•$*: : .s is s.s 
TTCCAAGCCCGAACTGGAATCTTCCTTCACACATCTGGACCAAGGTGAAAGACTGAACCTGTCC^ 

1120 • 1130 1140 1150 1160 1170 1180 

840 850 860 870 

inputs ATG — AGGTG GGC — ACTGOCCA ATGTAA TGTGACGCTGG AAGTGA 

^ .:: :.: :: ::s. . 

ATCCCAGGAGCACCTCCAGCCAACTTCACCATCCAGAAGGAAGATACGATTGTGTCACAGACTCAAGATT 
1190 1200 1210 1220 1230 1240 1250 

880 890 900 910 920 930 

inputs GCAC AGGGCCT-GGAG-CTG-CAGTGGTTGCTGAAGCTGT — TGTGGGTACC- - CTGGTTGG ACTG 

::: s • : : - sss.s.'. 

TCACCAAGATAGCCTCAAAGTCGGACAGTGGGACGTATATCTGCACTGCAGGTATTGACAAAGTGGTCAA 
1260 1270 1280 1290 1300 1310 1320 

940 950 960 970 980 

input s GGGTTGCTG-GCTGGGCTGGTCCTCTTGTA C-CACC — GCCGGGG CAAG GCCCTG 

:: • :.;* ::• 

GAAAAGCAACACAGTCCAGATAGTCGTATGTGAAATGCTCTCCCAGCCCAGGATTTCTTATGATGCCCAG 
1330 1340 1350 1360 1370 1380 1390 

990 1000 1010 1020 1030 1040 
i npu t S G AGG AG CCAG CCAATG ATATCAAG — G AGG ATG - CCATTGCTCCCCGG ACC CT — GCC- C 

• • 5 : : • m :: ::: : 
TTTGAG—GTCATAAAAGGACAGACCATCGAAGTCCGTTGCGAATCGATCAGTGGAACTTTGCCTATTTC 

1400 1410 1420 1430 1440 1450 1460 

1050 1060 1070 1080 1090 
inputs TGGCC — CAA — GAGCTCAGACACAATCTCCAAGAATGGGACCCTTTCCTCTGTCA — CCTCCG C 

» • • • ••• • S * > S S • • 1 1ml 1 .!•••! • • • • • • • • 

* • " " • • •••••••• * • • * • ••••»■ • « • • mm • 

TTACCAACTTTTAAAAACAAGTAAAGTTTTGGAGAATAGTACCAAGAACTCAAATGATCCTGCGGTATTC 
1470 1480 1490 1500 1510 1520 1530 

1100 1110 1120 1130 

inpu t s A CGAGCCCTCCG — GCCACC CCA- TGGC CC — TCCCAGGCCT 

s s.s sss.s : s s. * sss s ss ss ::::: sss. 

AAAGACAACCCCACTGAAGACGTCGAATACCAGTGTGTTGCAGATAATTGCCATT 

1540 1550 1560 1570 1580 1590 1600 

1140 1150 1160 1170 

inputs -GGTGCATTG « ACCCCC ACGCCCAG TCTATCCAGCCAGG 

* * s * • . s i tut : ; tut j : x : i : 

TAAGTGAGGTTCTGAGGGTGAAGGTGATAGCCCCXXJTGGATGAGGTCCAGATTTCT 

1610 1620 1630 1640 1650 1660 1670 

1180 

Lnputs -c C CTGC CC — TCACCAAG 

* t *s*s ss sssss.s 

GGTGGTGGAGTCTGGAGAGGACATTGTGCTGCAATGTGCTGTGAATC 

1680 1690 1700 1710 1720 1730 1740 
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1190 1200 1210 1220 1230 1240 

inputs ACATGC — CCACGACAGATGGGGCCCACCCT-CAACCAA-TATCCCC CATCCCTGG TGGGGT 

* - s s • s: .un ::::: : # t tti t : : 

AAGTTTTACAGAGAAAAAGAGCGCAAACCCTTCTATCAAATGACCTCAAATGCCACCCAGGCATTTTGGA 
1750 1760 1770 1780 1790 1800 1810 

1250 1260 1270 1280 1290 1300 
inputs TTTTTCCTTTGGCTT TGAGCCGCATGGGTG CTG NGCCTGTGATGGNGCCTGCC 

CCAAGCAGAAGGCTAGCAAGGAACAGGAGGGAGAGTATTACTGCACAGCCTTCAACAGAGCCAACCACGC 
1820 1830 1840 1850 ' 1860 1870 1880 

1310 1320 1330 1340 1350 

inputs CAGAGTC AAGCTGGCTCTCTG-GT-ATGATGACCCCACCACTCATTGG-CTAAAG 



• • • 



CTCCAGTGTCCCCAGAAGCAAAATACTGACAGTCAGAGTCATTCTTGCCCCATGGAAGAAAGGACTTATT 
1890 1900 1910 1920 1930 1940 1950 

1360 1370 1380 1390 

inputs G ATTTGGG -GTOTC TCCTTCCTATAAGGGTCA CCTCTAGCA CAGAGG — — 

: • 2 : s • s : : : 

GCAGTGGTTATCATCGGAGTGATCATTGCTCTCTTGATCATTGCGGCCAAATGTTATTTTCTGAGGAAAG 
1960 1970 1980 1990 2000 2010 2020 

1400 1410 1420 1430 1440 1450 
inputs CCTGAGTCATG-GGAA---AGAGTCACACTCCTGACCCTTAGTAC TCTG — CCCCCACCTC 



• • • 



CCAAGGCCAAGCAGATGCCAGTGGAAATGTCCAGGCCAGCAGTACCACTTCTGAACTCCAACAACGAGAA 
2030 2040 2050 2060 2070 2080 2090 

1460 1470 1480 1490 1500 1510 

i npu t S TCTTTAC TGTGGG AAAACCATC--TCAGTAAGACCTAAGTGTCCAGG AG ACAGAAGGAGAA-G A 

• • • 3 •••• •••••• S * S 2 £ £ • • • • • • « • • » «•••• • 

•♦• • • • • ••*••« ••••••••••• ••• 

AATGTCAGATCCCAATATGGAAGCTAACAGTCATTACGGTCACAATGAC—GATGTCAGAAACCATGCAA 
2100 2110 2120 2130 2140 2150 2160 

1520 1530 1540 1550 1560 1570 

inputs GGAAGtGGATCTGGAATTGGGAGGAGCCTCCACCCACC-CCTGAC — TCCTCC TTATGAAGCCAGC 

• t : .:.:tss:si • i ♦ : : j . : t : : : 

TGAAACCAATAAATGATAATAAAGAGCCTCTGAACTCAGA 

2170 2180 2190 2200 2210 2220 2230 

1580 1590 1600 1610 1620 1630 1640 

inputs TGCTGAAATTAGCTACTC^CCAAGAGTGAGGGGCAGAGACTTCCAGTCA GOC — 

•ttssi.* s # i s*:x«,:x«s;.rx ttt . t . :*,t u 
AGCTCAGTCTCACAAAGATCTAGGAAAGAAGGACAC^^ 

2240 2250 2260 2270 2280 2290 2300 

1650 1660 1670 1680 1690 1700 

inputs CCCTTGATCTGTACCCCAC CCCTA-—TCTAACACCACCCTTG — GCTCCCACTCCAGCTC~CCT 

t:*r r ♦ * t ttt :t:s; ts. 

CCCTGATGCOGTGGAAAGCAGATACTCTAGAACJGGAAGGCTCCCTTGATGGAACTTA 
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2310 2320 2330 2340 2350 2360 2370 

1710 1720 1730 1740 1750 1760 

inputs GTATTGATATAACCTGTCAGG-CTGGCTTGGTTAG-GTTTTACTGGGG CAGAGGATAGGGA 

*• JS • :. : : . 

GA — TGCACATCCCTGGAAGGACATCCATGTTCCGAGAAGAACAGATAATCCCTGTATTTCAAGACCTCT 
2380 2390 2400 2410 2420 2430 

1770 1780 1790 1800 1810 1820 

inputs -ATCTCTTATTAAAA CTAACATGAAATATGTGTTGTTTTCATTT — GCAAATTTAAATAAAGATACA 

• ::. : :: . . # . 

GTGCACTTATTTATGAACCTGCCCTGCTCCCACAGAACACAGCAATTCCTCAGGCTAAGCTGCCGGTTCT 
2440 2450 2460 2470 2480 2490 2500 

1830 1840 1850 1860 

input s TAAT GTTTGTATGAGATAAG AAAAAAAAAAAAAAAGGGCGGCCGC- 

:5S » • * : 

TAAATCCATCCTGCTAAGTTAATGTTGGGTAGAAAGAGATACAGAGGGG 
2510 2520 2530 2540 2550 
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TANGO 281 

Input file AthPb81dl0.seq; Output File AthPb81dl0.pat 
Sequence length 1812 



M R L 3 

GTCGACCCACGCGTCCGGCGGAGGTTGTGGCTGCACCGTGGTCCTCGGCTTGGTCCTC ATG CGT CTG 73 
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V P * 246 
GTT CCC TGA 802 

GCCTGCC^CCAGCCTCTT^ 881 

TGTGGTGCGTGTCCCTTGTCT 960 

GGGACC^TCTGTTTTCTTC 1039 

TATTTCCCGCTTCACCCCAAGC/TGACCAGCC^TA 1118 

GCTTTX3GGGAGTAGCCAGCTAGCTGCTGCTAT^ 1197 

GGGCeTCA&l^^ 1276 

GTCTGGACX5GTCCC^CT^ 1355 

t 

GGCCA(5G<kSCCTCTGTCTACTACACACTCTGGTTTCTCCCTACAGTGTCT^ 1434 

TTTTTGTATCCAGATGTGICATAATIXSGTGAC^ 1513 

FI6.28A 
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TCCGTTTCCCTTGACACCAGCTTCATAGAATACCTGACTCCTGTAC 1592 

AGGGCCAGGGGTTATCTGGACCAAGGGTGGGGGTGGAGAGCCTGGATGGTAGCTCTGGACCAGATGTC 1671 

ATTCCCTGTTGGTTCCTGTTTCACTGGCTC 1750 
TCGTTT ATAAT AAATG AA T ATTTG G AAAAAAAAAAAAAAAAAAAAAAAAAA 1812 
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I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I 
1 41 81 121 1G1 201 241 

>hT281 

MRLFVRPSVRPAMAAPAPS PWTLSLLLLLLLPS PG AHG ELCRPFG EDNS I PESC PDFCCG 

SCSSQYCCSDVLKKIQVWEEMCPEPESSRFSAHPETPEQLGSALKYQSSLDSDNMPGFGA 
™A^GLTVFVVFIATIIVCFTCSCCCLYKMrc^ 

*~^<^ TOp MPPQPGMPAAPYPT«YPPPYIAQPTGPPAYHETIAGASQPPyNPAYMDP 
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Alignments of top-scoring domains: 

PSBH: domain 1 of 1, from 97 to 146: score 5.5, E = 8.5 

# ->ktalgelLkPlnseyGKvaPgV;GttplmgvfroalfavFLliileiVn 
*lg+ Lk s *Pg+G t+ +g +++f*vF+ i+ ♦ 

nT281 97 PEQLGSALKYQSSLDSDNMPGFGATVAIG — LTVFWFIATIIVCFT 141 



ssvll<-* 

■ s 

hT281 142 CSCCC . 146 
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Input file T281Atmea49d3; Output File T281Atmea49d3 .pat 
Sequence length 1858 

3TCGACCCACGCGTCCGCGCGGAGG 7 9 

MAA PAPSLWTLLLLLLL 17 

ttotcccgcc atg gct gcg ccg gcg ccc tct ctg tgg acc cta ttg ctg ctg ctg ttg ctg 140 

lppppgahge'lcrpfgedns 37 

TTG CCG CCG CCT CCG GGT GCC CAT GGT GAG CTG TGC AGG CCC TTT GGT GAA GAC AAT TCG 200 

IPV FCPDFCCGSCSNQYCCS 57 

*TC CCA GTG TTC TGT CCT GAT TTC TGT TGT GGT TCC TGT TCC AAC CAA TAC TGC TGC TCG 260 

DVLRKIQWNEEMCPEPESSR 77 

AC GTG CTG AGG AAA ATC CAG TGG AAT GAG GAA ATG TGT CCT GAG CCA GAG TCC AGC AGA 320 

FSTPAE ETPEHL G SALKFRS 97 

TT TCC ACC CCC GCG GAG GAG ACA CCC GAA CAT CTG GGT TCA GCG CTG AAA TTT CGA TCC 380 

SFDSDPM SGFGATVAIGVT I 117 

3T TTT GAC AGT GAC CCT ATG TCA GGG TTC GGA GCG ACC GTC GCC ATT GGC GTG ACC ATC 440 

rV VF IATlIICFTCSCCCLY137 

IT GTG GTG TTT ATT GCC ACT ATC ATC ATC TGC TTC ACC TGC TCC TGC TGC TGT CTG TAT 500 

<MCCPQRPVVTNTTTTTVVH 157 

VG ATG TGC TGC CCC CAA CGC CCT GTC GTG ACC AAC ACC ACA ACT ACT ACC GTG GTT CAT 560 

tPYP. QPQ.pQpvAPSYPGPTY 177 

:C CCT TAC CCT CAG CCT CAA CCT CAA CCT GTG GCC CCC AGC TAT CCT GGA CCA ACA TAC 620 

>GYHPMPPPAR NAS S TLPNA 197 

.G GGC TAC CAT CCC ATG CCC CCC CCA GCC AGG AAT GCC AGC AGC ACC CTA CCC AAC GCA 680 

PT TLPGPAHRAATLP* 214 

'A CCC ACC ACC CTA CCT GGC CCA GCC CAC AGG GCC GCC ACC CTA CCA TGA 731 

•CCITGGCTGGAGCCAGCCAGCCTCC^TAC^ 810 

GCCTCTTTGGCTGCCATTTATGTCGTGTGTGAGTGAGTC 889 

TttmrTAGACATGTGGCTTCCTCTGCTGTTC 968 

TCTTCCTCACTTCAAATTGTACTTTCIX3AAATTT 104 7 

CCCAAGGTGACCAGCCATGGCCTOTCATAC 1126 

^GCTAGCTGCTGCTAGGC^^ 12 05 

TGACAAGGGGACTCAGTGGCAGGGGGTCACACCAG 1284 



FIG. 31 A 
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ACTGTCCTTCCCXXXX3CTOTATAGAGGGCCACA 1363 

TGGCTTGGCTCCTGCTCCCAGGCC^GGGAC 144 2 

rTGTTAGCCAAACATTTTGCCTGTTTTCTGTCTCCAGATGTGTGA 1521 

XGGACAGACAACCTGACCTCCGACTGTCAGTTTCCCTTGAC^ 1600 

rCAGTTTGTCCCAGTAGCAGGGACACCAAGGCCAATGGGTTATCTGGACCAAAGGTC 1679 

rTCCGGCCCAGATGTGAATACCTCCATATTCTC 1758 

STTTCTGAGCATTCAGACTCCGCACCCTCATTT^ 1837 

\AAAAAAAAAGGGCGGCCGC 1858 
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r-s W!tl hit* 
Cyi 'l fill t IUI 




1 1 I ., 1 1 1 ,. 1 1 1 1 1 1 1 1 1 1 1 • 

1 41 81 1Z1 161 201 



•mT281 

MAAPAP S LVTTLLIXLLLLP P P PG AHG ELCRPFG EDNS I PVTC PDFCCG SCSNQYCC S DVL 
RKIQWNEEMC PEPES SRFSTPAEETPEHLGSALKFRS SFOSDPMSGFG ATVAIGVTI FW 
FIATIIICFTCSCCCLYKMCCPQRPVVTNTTTTTVVHAPypQ 
HPMPPPARNASSTLPNAVPTTLPGPAHRAATLP 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT281 a. a. 245 aa vs. 

> roT281 a. a. 213 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

66,5% identity; Global alignment score: 739 

10 20 30 40 50 60 70 

inputs" MRLFVRPSVRPAMAAPAPS PWTLSLLLLLLLPS PG AHGELCRPFGEDNS I PESC PDFCCGSCSSQ YCCSD 
: ::::::::::::::: : : 

M AAPAPSLWTLLLLLLLLPPPPGAHGELCRPFGEDNSIPVFCPDFCCGSCSNQYCCSD 

10 20 30 40 50 

80 90 100 110 120 130 

inputs VLKKIQWNEEMCPEPESSRFSAHPE-TPEQLGSALKYQSSLDSDNMPGFGATVAIGLTVFWFIATIIVC 

•: :::.:::::■•••••••• 

VLRKIQWNEEMCPEPESSRFSTPAEETPEHLGSALKFRSSFDSDPMSGFGATVAIGVTIFWFIATIIIC 
60 70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs FTCSCCCLYKMCCRPRPWStTTTTTTVVHTAYPQ^ 

FTCSrcCLYWKCCPQRP^ PARN 

130 140 150 160 170 180 

210 220 . 230 240 

inputs PPPYLAQPTGPPAYHETLAGASQPPYNPAYMDPPKAVP 
• ••2 : . • : • : :■» •♦: 

ASSTL--PNAVPT TLPGPAHRA ATLP 

190 200 210 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals, Inc. 
<120> SECRETED PROTEINS AND USES THEREOF 



<130> 7853-210-228 

<140> 
<141> 

<150> 09/336,536 
<151> 1999-06-18 



<160> 198 



<170> Patentln Ver. 2.0 

<210> 1 
<211> 1338 
<212> DNA 

<213> Homo sapiens 
<400> 1 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 480 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 

.gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 
aaaaaaaagg gcggccgc 1338 

<210> 2 

<211> 728 

<212> DNA 

<213> Homo sapiens 

<400> 2 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 
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gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 3 00 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 

tctttgct " " ~ 728 

<210> 3 

<211> 243 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 
1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 
85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 
100 105 HO 

Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg Val 
115 120 125 

Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 
130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser He Ala 
165 170 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 
180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 
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Val Gly Val Gly Asp Tyr lie Gly He Tyr Ala Ser He Lys Thr Asp 
210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 4 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Ser Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His 
1 5 io ^ 15 

Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
35 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 . * 95 

Glu Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg 
100 105 HO 

Val Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe 
115 120 ~ 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser He 
145 150 155 160 

Ala Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 215 220 
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Pro Val Phe Ala 
225 



<210> 5 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly 
15 10 15 



<210> 6 . 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly Arg Asp Gly 
1 5 10 15 

Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu Lys Gly Glu 
20 25 30 

Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro Gly Pro Arg 
35 40 45 

Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
50 55 60 



<210> 7 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ser Asp 
1 5 10 15 

Ala Pro Leu Pro Phe Asp Arg Val Leu Val Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Ala Val Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 

Val Lys Asn Gly Glu Ser He Ala Ser Phe Phe Gin Phe Phe Gly Gly 
65 70 75 80 

Trp Pro Lys Pro Ala Ser Leu Ser Gly Gly Ala Met Val Arg Leu Glu 
85 90 95 

Pro Glu Asp Gin Val Trp Val Gin Val Gly Val Gly Asp Tyr He Gly 
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100 105 110 

He Tyr Ala Ser He Lys Thr Asp Ser Thr Phe Ser Gly Phe Leu Val 
115 120 125 

<210> 8 

<211> 1263 

<212> DNA 

<213> Mus musculus 



<400> 8 

gtcgacccac gcgtccgcgc tgtgaagcca 
cagcaaggac aggggctgcc tgcctacaga 
ccggggtcac caccatgagg ccacttcttg 
ctcctcctct ggacgacaac aagatcccca 
gcacaccagg tcaccatggc agccaaggcc 
acggtgcacc cggagctccg ggagagaaag 
cacgtgggga gcccgggccg cgtggagagg 
gggagtgctc ggtaccccca cgatcagcct 
ctccgccagc cgacacaccc ctacctttcg 
acgaccccac tactggcaag ttcacctgcc 
acgccactgt ctaccgggcc agcttgcagt 
cctctttctt ccagtatttt ggggggtggc 
tggtaaggct agaacctgag gaccaggtgt 
gcatctatgc cagcatcaag acagacagta 
ggcacagctc cccagtcttc gcttaaaaca 
agtggagggt gtgacactaa cccgcgcagc 
ttgtgaatga cttaggaaga gagggagcca 
gacaggctgt ctgaggtcaa gacagcgtgg 
agaatgcagt aggctggcag ctgtgggtcc 
ttcctagtcc tgtgtcccct ctaggtccct 
ctttttctca gaggtcactt aataaaccta 
cgc 

<210> 9 

<211> 729 

<212> DNA . 

<213> Mus musculus 



gcaaggagca accagaagct aggagtcagt 60 
ctacaagaga ggttcctgga gtctgagcct 120 
cccttctgct tctgggtctg gtgtcaggct 180 
gcctgtgtcc cgggcagccc ggccttccag 24 0 
tgcctggccg tgacggccgt gatggccgcg 300 
gcgagggcgg gagaccggga ctacctggcc 360 
cagggcccat gggggctatc gggcctgcgg 420 
tcagtgccaa gcgatccgag agccgggtac 4 80 
accgtgtgct gctaaatgag cagggccatt 540 
aagtgcctgg cgtctactac tttgctgtgc 600 
ttgatcttgt caaaaacggg cagtccatcg 660 
ccaagccagc ctcgctctca gggggtgcga 720 
gggtgcaggt gggcgtgggt gattacattg 780 ■ 
ccttctctgg atttctcgtc tattctgact 840 
cagtgaaccc ggagctggca cttgctcctc 900 
gcataccagg agggctggcc ccctggaata 960 
cttccagtcc cactgctggc aatgaatgga 1020 
agcagtggct gggtttctgc ccaggacttt 1080 
tggcccagga ctccaaggtg ggatgctcca 1140 
gactccatct ctgctgctcc cagggcaggc 1200 
aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

1263 



<400> 9 

atgaggccac ttcttgccct tctgcttctg 
gacaacaaga tccccagcct gtgtcccggg 
catggcagcc aaggcctgcc tggccgtgac 
gctccgggag agaaaggcga gggcgggaga 
gggccgcgtg gagaggcagg gcccatgggg 
cccccacgat cagccttcag tgccaagcga 
acacccctac ctttcgaccg tgtgctgcta 
ggcaagttca cctgccaagt gcctggcgtc 
cgggccagct tgcagtttga tcttgtcaaa 
tattttgggg ggtggcccaa gccagcctcg 
cctgaggacc aggtgtgggt gcaggtgggc 
atcaagacag acagtacctt ctctggattt 
gtcttcgct 

<210> 10 
<211> 243 
<212> PRT 

<213> Mus musculus 



ggtctggtgt caggctctcc tcctctggac 60 
cagcccggcc ttccaggcac accaggtcac 120 
ggccgtgatg gccgcgacgg tgcacccgga 180 
ccgggactac ctggcccacg tggggagccc 240 
gctatcgggc ctgcggggga gtgctcggta 300 
tccgagagcc gggtacctcc gccagccgac 360 
aatgagcagg gccattacga ccccactact 420 
tactactttg ctgtgcacgc cactgtctac 4 80 
aacgggcagt ccatcgcctc tttcttccag 540 
ctctcagggg gtgcgatggt aaggctagaa 600 
9tgggtgatt acattggcat ctatgccagc 660 
ctcgtctatt ctgactggca cagctcccca 720 

729 
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<400> 10 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 
1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly Gin Pro 
20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu Pro 
. 65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala lie Gly Pro Ala Gly 
85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 
100 105 110 

Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg Val 
115 120 125 

Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe Thr 
130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 . 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser He Ala 
165 170 175 

Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 
180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 

Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 
210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 11 

<211> 228 

<212> PRT 

<213> Mus musculus 

<400> 11 

Ser Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly Gin 
15 10 15 
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Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 1 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
35 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala lie Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 95 

Glu Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg 
100 105 110 

Val Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe 
115 120 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser He 
145 150 155 160 

Ala Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 215 220 

Pro Val Phe Ala 
225 



<210> 12 

<211> 15 

<212> PRT 

<213> Mus musculus 

<400> 12 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly 
15 10 15 



<210> 13 
<211> 60 
<212> PRT 

<213> Mus musculus 



7 



WO 00/78808 



PCT/US00/16883 



<400> 13 
Gly Thr Pro Gly 
1 

Arg Asp Gly Arg 
20 

Gly Gly Arg Pro 
35 

Gly Glu Ala Gly 
50 



His His Gly Ser 
5 

Asp Gly Ala Pro 

Gly Leu Pro Gly 
40 

Pro Met Gly Ala 
55 



Gin Gly Leu Pro 
10 

Gly Ala Pro Gly 
25 

Pro Arg Gly Glu 



He Gly Pro Ala 
60 



Gly Arg Asp Gly 
15 

Glu Lys Gly Glu 
30 

Pro Gly Pro Arg 
45 



<210> 14 

<211> 128 

<212> PRT 

<213> Mus musculus 

<400> 14 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ala Asp 
1 5 .10 15 

Thr Pro Leu Pro Phe Asp Arg Val Leu Leu Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Pro Thr Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 



Val Lys Asn Gly Gin Ser He Ala 
65 70 

Trp Pro Lys Pro Ala Ser Leu Ser 
85 

Pro Glu Asp Gin Val Trp Val Gin 
100 

He Tyr Ala Ser He Lys Thr Asp 
115 120 



Ser Phe Phe Gin Tyr Phe Gly Gly 
75 80 

Gly Gly Ala Met Val Arg Leu Glu 
90 95 

Val Gly Val Gly Asp Tyr He Gly 
105 110 

Ser Thr Phe Ser Gly Phe Leu Val 
125 



<210> 15 

<211> 1831 

<212> DNA 

<213> Homo sapiens 



<400> 15 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 300 

gagcgggagg cactcagaac tgaggccgac' accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 
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aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 480 
gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 
ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 
ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 
gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 
cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 
99tggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 
gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 
acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 
gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 
tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 
tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 
agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 
catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 
cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 
gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 
aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 
agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 
atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 
gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 
gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 
tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 16 

<211> 1218 

<212> DNA 

<213> Homo sapiens 



<400> 16 

atggggccca gcacccctct cctcatcttg 
ggacagcagc accaccttgt ggagtacatg 
ctggcccagt gccaggacca gagtagtcgg 
aagatgctgc cactgctgga ggtggcagag 
gacaccatct ccgggagagt ggatcgtctg 
aacccagctc tgccctgtgt agagtttgat 
ggcaagggaa gaaggaatga gaagtacgat 
caagtgagat caatgaagat tctgaagcga 
gatccactgg ggcaaacaga gaagatctac 
tttgtcttcc caaggctgcg tgacttcacc 
gtccgggtgc ccttcccctg ggtaggcaca 
tttgctcgga ggcctcctgg aagacctggt 
ctaatcaaat tccacctggc aaaccgaaca 
gggctgatcc ccccctacgg cttgacagca 
gaaggtcttt gggctgtcta tgccacccgg 
ttagatccac agacactgga cacagagcag 
gctgaggctg cctttgtcat ctgtgggacc 
agtcgggccc gcatccagtg ctcctttgat 
gcactccctt attttccccg cagatatggt 
gaacgccagc tctatgcctg ggatgatggc 
aagaaagagg aggaggtt 

<210> 17 
<211> 406 
<212> PRT 

<213> Homo sapiens 



ttccttttgt catggtcggg acccctccaa 60 
gaacgccgac tagctgcttt agaggaacgg 12 0 
catgctgctg agctgcggga cttcaagaac 180 
aaggagcggg aggcactcag aactgaggcc 240 
gagcgggagg tagactatct ggagacccag 300 
gagaaggtga ctggaggccc tgggaccaaa 360 
atggtgacag actgtggcta cacaatctct 420 
tttggtggcc cagctggtct atggaccaag 480 
gtgttagatg ggacacagaa tgacacagcc 540 
cttgccatgg ctgcccggaa agcttcccga 600 
gggcagctgg tatatggtgg ctttctttat 660 
ggaggtggtg agatggagaa cactttgcag 720 
9tggtggaca gctcagtatt cccagcagag 780 
gacacctaca tcgacctggc agctgatgag 840 
gaggatgaca ggcacttgtg tctggccaag 900 
cagtgggaca caccatgtcc cagagagaat 960 
ctctatgtcg tctataacac ccgtcctgcc 1020 
gccagcggca ccctgacccc tgaacgggca 1080 
gcccatgcca gcctccgcta taacccccga 1140 
taccagattg tctataagct ggagatgagg 1200 

1218 
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<400> 17 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 



Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 
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Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val lie Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 
355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 
370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg 
385 390 395 ** 400 

Lys Lys Glu Glu Glu Val 
405 



<210> 18 
<211> 385 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 
15 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 
20 25 ^ 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 
35 40 45 

Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala Asp Thr He Ser Gly 
50 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys Val Thr Gly Gly Pro 
85 90 95 

Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 
100 105 110 

Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser Met Lys He Leu Lys 
115 120 125 

Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys Asp Pro Leu Gly Gin 
130 135 140 

Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 
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Val Phe Pro Arg Leu 
165 

Ala Ser Arg Val Arg 
180 

Val Tyr Gly Gly Phe 
195 

Gly Gly Gly Gly Glu 
210 

Leu Ala Asn Arg Thr 
225 

Leu lie Pro Pro Tyr 
245 

Ala Asp Glu Glu Gly 
260 

Arg His Leu Cys Leu 
275 

Gin Gin Trp Asp Thr 
290 



Arg Asp Phe Thr Leu Ala 
170 

Val Pro Phe Pro Trp Val 
185 

Leu Tyr Phe Ala Arg Arg 
200 

Met Glu Asn Thr Leu Gin 
215 

Val Val Asp Ser Ser Val 
230 235 

Gly Leu Thr Ala Asp Thr 
250 

Leu Trp Ala Val Tyr Ala 
265 

Ala Lys Leu Asp Pro Gin 
280 

Pro Cys Pro Arg Glu Asn 
295 



Met. Ala Ala Arg Lys 
175 

Gly Thr Gly Gin Leu 
190 

Pro Pro Gly Arg Pro 
205 

Leu lie Lys Phe His 
220 

Phe Pro Ala Glu Gly 
240 

Tyr lie Asp Leu Ala 
255 

Thr Arg Glu Asp Asp 
270 

Thr Leu Asp Thr Glu 
285 

Ala Glu Ala Ala Phe 
300 



Val He Cys Gly Thr Leu Tyr Val 
305 310 

Arg Ala Arg He Gin Cys Ser Phe 
325 

Glu Arg Ala Ala Leu Pro Tyr Phe 
340 

Ser Leu Arg Tyr Asn Pro Arg Glu 
355 360 

Gly Tyr Gin He Val Tyr Lys Leu 
370 375 

Val 
385 



Val Tyr Asn Thr Arg Pro Ala Ser 
315 320 

Asp Ala Ser Gly Thr Leu Thr Pro 
330 335 

Pro Arg. Arg Tyr Gly Ala His Ala 
345 350 

Arg Gin Leu Tyr Ala Trp Asp Asp 
365 

Glu Met Arg Lys Lys Glu Glu Glu 
380 



<210> 19 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 19 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly 
20 
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<210> 20 

<211> 244 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Leu Leu Leu Gly Ala Val Leu Leu Leu Leu Ala Leu Pro Gly His 
1 5 10 15 

Asp Gin Glu Thr Thr Thr Gin Gly Pro Gly Val Leu Leu Pro Leu Pro 
20 25 30 

Lys Gly Ala Cys Thr Gly Trp Met Ala Gly He Pro Gly His Pro Gly 
35 40 45 

His Asn Gly Ala Pro Gly Arg Asp Gly Arg Asp Gly Thr Pro Gly Glu 
50 55 60 

Lys Gly Glu Lys Gly Asp Pro Gly Leu He Gly Pro Lys Gly Asp He 
65 70 75 80 

Gly Glu Thr Gly Val Pro Gly Ala Glu Gly Pro Arg Gly Phe Pro Gly 
85 90 95 

He Gin Gly Arg Lys Gly Glu Pro Gly Glu Gly Ala Tyr Val Tyr Arg 
100 105 * HO 

Ser Ala Phe Ser Val Gly Leu Glu Thr Tyr Val Thr He Pro Asn Met 
115 120 125 

Pro lie Arg Phe Thr Lys He Phe Tyr Asn Gin Gin Asn His Tyr Asp 
130 135 140 

Gly Ser Thr Gly Lys Phe His Cys Asn He Pro Gly Leu Tyr Tyr Phe 
145 150 155 160 

Ala Tyr His He Thr Val Tyr Met Lys Asp Val Lys Val Ser Leu Phe 
165 170 175 

Lys Lys Asp Lys Ala Met Leu Phe Thr Tyr Asp Gin Tyr Gin Glu Asn 
180 185 190 

Asn Val Asp Gin Ala Ser Gly Ser Val Leu Leu His Leu Glu Val Gly 
195 200 205 

Asp Gin Val Trp Leu Gin Val Tyr Gly Glu Gly Glu Arg Asn Gly Leu 
210 215 220 

Tyr Ala Asp Asn Asp Asn Asp Ser Thr Phe Thr Gly Phe Leu Leu Tyr 
225 230 235 240 

His Asp Thr Asn 



<210> 21 
<211> 1721 
<212> DNA 
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<213> Mus musculus 
<400> 21 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 
catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 
aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 3 00 
gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 
gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 
atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 
tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 
gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 
cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 
gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 
gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 
gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 
gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 
gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 
cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 
ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 
gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 
gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 
taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 
cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 
tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 
aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 
aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 
aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 
ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 
tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 
aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 22 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 22 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 
ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 
ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 
aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 
gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 
aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 
ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 
caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 
gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 
tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 
attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 
tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 
ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 
agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 
gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 
ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 
gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 
agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 
gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 
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gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 
aagaaggagg aggaagtt 1218 

<210> 23 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 23 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 
15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 * 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg lie Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 
210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
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260 



265 



270 



Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 
355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 
370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 390 395 400 



Lys Lys Glu Glu Glu Val 
405 



<210> 24 

<211> 385 

<212> PRT 

<213> Mus musculus 

<400> 24 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 
1 5 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 
20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 
35 40 45 

Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala Asp Ser He Ser Gly 
50 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys Val Thr Gly Gly Pro 
85 90 95 

Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 
100 105 110 

Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser Met Lys He Leu Lys 
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115 120 125 

Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys Asp Pro Leu Gly Pro 
130 135 140 

Ala Glu Lys lie Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 

Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala Met Ala Ala Arg Lys 
165 170 175 

Ala Ser Arg lie Arg Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu 
180 185 190 

Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg Pro Pro Gly Gly Pro 
195 200 205 

Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin Leu lie Lys Phe His 
210 215 220 

Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val Phe Pro Ala Glu Ser 
225 230 235 240 

Leu lie Pro Pro Tyr Gly Leu Thr Ala Asp Thr Tyr lie Asp Leu Ala 
245 250 ~ 255 

Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala Thr Arg Asp Asp Asp 
260 265 • 270 

Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin Thr Leu Asp Thr Glu 
275 280 285 

Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn Ala Glu Ala Ala Phe 
290 295 300 

Val lie Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr Arg Pro Ala Ser 
305 310 315 ~ 320 

Arg Ala Arg lie Gin Cys Ser Phe Asp Ala Ser Gly Thr Leu Ala Pro 
325 330 335 

Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg Tyr Gly Ala His Ala 
340 345 350 

Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr Ala Trp Asp Asp 
355 360 365 

Gly Tyr Gin lie Val Tyr Lys Leu Glu Met Lys Lys Lys Glu Glu Glu 
370 375 380 

Val 
385 



<210> 25 

<211> 21 

<212> PRT 

<213> Mus musculus 



17 



WO 00/78808 



PCT/US00/16883 



<400> 25 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 
1 5 10 15 

Gly Pro Leu Gin Gly 
20 



<210> 26 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n«a, c, g, or t 

<400> 26 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 
gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 
cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 
ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 
agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 
cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 
ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 
gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 
ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 
tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 
tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 
tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 
agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 
gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 
tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 
ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 
tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 
ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 
tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 
cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 
acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 
tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 
ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 
tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 
ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 
acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 
ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 
cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 
tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 
actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 
tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 
ggcggccgc 1869 

<210> 27 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> modif iedjbase 
<222> all "n" positions 
<223> n=a ( c, g, or t 

<400> 27 

atgatttccc tcccggggcc cctggtgacc 
agtgccctcg cgcccccctc gcgggcccag 
caggcggtgg aggaggggga aagtggtgct 
tcttcatccc agccatggga ggtgcccttt 
gaggatcagg tgttgtccta catcaatggg 
gtctactcca tgccctcccg gaacctgtcc 
tctggcccct acagctgctc cgtgaatgtg 
agcatcaaaa ccttagaact caatgtactg 
cagggtgtgc cccatgtggg ggcaaacgtg 
cccgctgtcc aataccagtg ggatcggcag 
gcattagatg tcatccgtgg gtctttaagc 
gtctatgtct gcaaggccca caatgaggtg 
gtgagcacag ggcctggagc tgcagtggtt 
ctggggttgc tggctgggct ggtcctcttg 
ccagccaatg atatcaagga ggatgccatt 
tcagacacaa tctccaagaa tgggaccctt 
ccaccccatg gccctcccag gcctggtgca 
gccctgccct caccaagaca tgcccacgac 
catccctggt ggggtttttt cctttggctt 

<210> 28 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 
<400> 28 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 110 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 



aacttgntgc ggtttttgtt cctggggctg 60 
ctgcaactgc acttgcccgc caaccggttg 120 
tcagcatggt acaccttgca cagggaggtg 180 
gtgatgtggt tcttcaaaca gaaagaaaag 240 
gtcacaacaa gcaaacctgg agtatccttg 300 
ctgcgggtgg agggtctcca ggagaaagac 360 
caagacaaac aaggcaaatc taggggccac 420 
gttcctecag ctcctccatc ctgccgtctc 480 
accctgagct gccagtctcc aaggagtaag 540 
cttccatcct tccagacttt ctttgcacca 600 
ctcaccaacc tttcgtcttc catggctgga 660 
ggcactgccc aatgtaatgt gacgctggaa 720 
gctgaagctg ttgtgggtac cctggttgga 780 
taccaccgcc ggggcaaggc cctggaggag 840 
gctccccgga ccctgccctg gcccaagagc 900 
tcctctgtca cctccgcacg agccctccgg 960 
ttgaccccca cgcccagtct atccagccag 1020 
agatggggcc caccctcaac caatatcccc 1080 

1110 
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115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser lie Lys Thr 
130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 
165 170 ' 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 
180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 
195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 
210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 
245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 
260 265 270 

Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 
275 280 " 285 

Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 
290 295 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 320 

Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 
325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 
340 345 350 

Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 



<210> 29 
<211> 341 
<212> PRT 

<213> Mus musculus 
<400> 29 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 
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10 



15 



Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 
20 25 ^ 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 
35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr 
50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 
85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 
100 105 110 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 
115 120 125 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 
130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 * 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 
165 170 175 

Arg Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 
180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 
195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala 
210 215 220 

Val Val Gly Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu 
225 230 235 240 

Leu Tyr His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He 
245 250 255 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser 
260 265 270 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
275 280 285 

Ala Leu Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro 
290 295 300 



Thr Pro Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His 
305 310 315 320 
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Asp Arg Trp Gly Pro Pro Ser Thr Asn lie Pro His Pro Trp Trp Gly 
325 330 335 

Phe Phe Leu Trp Leu 
340 



<210> 30 

<211> 29 

<212> PRT 

<213> Mus musculus 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 
<400> 30 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala 
20 25 



<210> 31 

<211> 246 

<212> PRT 

<213> Mus musculus 

<220> 

<221> SITE 
<222> (13) 

<22 3> Xaa=unknown amino acid 
<400> 31 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 no 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
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115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser lie Lys Thr 
130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 
165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 
180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val lie Arg Gly Ser 
195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 
210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly 
245 



<210> 32 

<211> 653 

<212> DNA 

<213> Homo sapiens 

<400> 32 

ttttttgcat gtaacttttt tattgaggca caacaaggca ttgtaacttg cctggacttg 60" 
aggcagtcag tttagtaagc tgaacgttaa tacagttaag gattaagtgc aaacaatata 120 
cattcacagc ttgactagcg aggctacatc acaatttata aagtgccaga ttagtgctaa 180 
ttgtcattca gcttgatttt tcacctcagg aaggaaaaca aaaaagtaag gacctcctcc 240 
ctctaggaac aaaaaacatt ttcctaaacc aatcagtcat gagggcaaag actacttttc 3 00 
cttcaatccc actaattaga acaccatcct tttattgtca atactgtact gactttcaat 360 
cttgataaag aagatagcct gaaaacgtag aatatttcca gctacttcca taaattgctc 420 
ccctgtgcag acgtaaccat atctggtctc cctggaagag ctgaagaatt gcatgattgc 4 80 
tagcagtttc atggtctgga gcaccatcat tggcataggc tgataccaag acctcttcat 540 
tcttcantga ggttgacata cagtggcaca ttcactgcca gcttttacat gtgaaaaatg 600 
aaaaacgtag tgccattcac ttggcaatta aatctaccaa agctgagatc aaa . 653 

<210> 33 

<211> 25 

<212> PRT 

<213> Mus musculus 

<400> 33 

Ala Ala Val Val Ala Glu Ala Val Val Gly Thr Leu Val Gly Leu Gly 
15 10 15 " 

Leu Leu Ala Gly Leu Val Leu Leu Tyr 
20 25 
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<210> 34 

<211> 99 

<212> PRT 

<213> Mus musculus 

<400> 34 

His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu 
1 5 10 15 

Asp Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr 
20 25 30 

He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 40 45 

Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro 
50 55 60 

Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg 
65 70 75 80 

Trp Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe 
85 90 95 

Leu Trp Leu 



<210> 35 

<211> 80 

<212> PRT 

<213> Mus musculus 

<400> 35 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
1 5 10 15 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
20 25 30 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
35 40 45 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
50 55 60 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
65 70 75 80 



<210> 36 

<211> 60 

<212> PRT 

<213> Mus musculus 
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<400> 36 

Gly Ala Asn Val Thr Leu Ser Cys Gin Ser Pro Arg Ser Lys Pro Ala 
1 5 10 ^ 15 

Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro Ser Phe Gin Thr Phe Phe 
20 25 30 

Ala Pro Ala Leu Asp Val lie Arg Gly Ser Leu Ser Leu Thr Asn Leu 
35 40 45 

Ser Ser Ser Met Ala Gly Val Tyr Val Cys Lys Ala 
50 55 60 



<210> 37 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 37 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 
caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 
tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 
cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 
agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 
ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 
ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 
ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 
gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 
caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 
tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 
tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 
agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 
tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 
gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 
gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 
caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 
cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 
cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 
cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 
tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 
tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 
ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 
tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 
ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 
gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 
ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 
tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 
gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 
agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

<210> 38 

<211> 1182 

<212> DNA 

<213> Mus musculus 

<400> 38 
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atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc cegagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 39 

<211> 394 

<212> PRT 

<213> Mus musculus 

<400> 39 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 . " 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 
130 135 140 

He Lys Ser lie Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
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165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser lie Ala Met Ser Gly Val 
210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 
245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 
260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 
275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 
290 295 300 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
305 310 315 320 

Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr 
325 330 335 

Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro 
340 345 350 

Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly 
355 360 365 

Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val 
370 375 380 

Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
385 390 



<210> 40 
<211> 365 
<212> PRT 

<213> Mus musculus 
<400> 40 

Gin Met Glu Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val 
l 5 10 15 

Glu Gly Glu Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu 
20 25 30 

Glu Ser Trp Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu 
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35 40 45 

Glu Gin Glu Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr lie Asn Gly 
50 55 60 

Val Met Thr Asn Lys Pro Gly Thr Ala Leu Val His Ser lie Ser Ser 
65 70 75 80 

Arg Asn Val Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly 
85 90 ' 95 

Thr Tyr Arg Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser lie 
100 105 110 

Gly His Ser lie Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala 
115 120 125 

Pro Pro Ser Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val 
130 135 140 

Thr Leu Asn Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin 
145 150 155 160 

Trp Glu Arg Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu 
165 170 175 

Asp Ala Val Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met 
180 185 190 

Ser Gly Val Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys 
195 200 205 

Cys Asn Val Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val 
210 215 220 

Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly 
225 230 235 240 

Leu Val Leu Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala 
245 250 255 

Asn Asp He Lys. Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr 
260 265 "* 270 

Lys Gly Ser Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr 
275 280 285 

Ser Ala Arg Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly 
290 295 300 

Thr Phe Thr Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro 
305 310 315 320 

Arg Leu Pro Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr 
325 330 335 

Pro Gly Gly Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro 
340 345 350 
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Val Met Val Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
355 360 365 



<210> 41 
<211> 29 
<212> PRT 

<213> Mus musculus 
<400> 41 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala 
20 25 



<210> 42 

<211> 249 

<212> PRT 

<213> Mus musculus 

<400> 42 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 .55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 
130 135 140 

He Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 



29 



WO 00/78808 



PCT/US00/16883 



Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser lie Ala Met Ser Gly Val 
210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys 
245 



<210> 43 

<211> 355 

<212> PRT 

<213> Mus musculus 



<220> 

<221> SITE 
<222> (355) 

<223> Xaa=unknown amino acid 
<400> 43 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 



Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr lie Ser Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys ile Tyr Val Leu Asp Gly Thr Gin 
165 170 175 



Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
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180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 * 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 ~ 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val lie Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 

Gly Pro Xaa 
355 



<210> 44 
<211> 25 
<212> PRT 

<213> Mus musculus 
<400> 44 

Ala Ala Val Val Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val 
1 5 10 15 

Leu He Ala Gly Leu Val Leu Leu Tyr 
20 25 



<210> 45 
<211> 120 
<212> PRT 

<213> Mus musculus 
<400> 45 

Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He Lys Glu 
15 10 15 
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Asp Ala lie Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser Asp Thr 
20 25 30 

lie Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 . 40 45 

Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr Pro Thr 
50 55 60 

Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro Arg Val 
65 70 75 80 

Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly Val Ser 
85 90 95 

Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val Pro Ala 
100 105 110 



Gin Ser Gin Ala Gly Ser Leu Val 
115 120 



<210> 46 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gtcgacccac gcgtccggcg gaggttgtgg 
cttgatgcgt ctgtttgtcc gtccgtccgt 
tccgtggacc ctttcgctgc tgctgttgtt 
gctgtgcagg cccttcggtg aagacaattc 
tggctcctgt tccagccaat actgctgctc 
ggaaatgtgc cctgagccag agtccagcag 
gctgggttca gcgctgaagt atcagtccag 
agcgaccgtg gccatcggcc tgaccgtctt 
ctttacctgc tcctgctgct gtctatataa 
caacaccaca actactaccg tggttcacac 
cagctatcct ggaccaacat accagggcta 
agcagcaccc tacccaacgc agtaccctcc 
agcctatcat gagacgttgg ctggagccag 
tcccccaaag gcagttccct gagcctgccc 
catgtgtgtg tgagtgctat gcagagttct 
ctagacatgt ggcttcctct gctgatgacc 
gggaccaatc tgttttcttc ctcacttgaa 
aaaacaatag ggtaggaggt atttcccgct 
acacatagga gagcaagctt tttgtgggtc 
ctgctgctat gggtttattc ccagggcttg 
cctcagtggc agtgggtcag tgactgatgt 
tctccatcag ctgtctgtct ggacggtccc 
acatgtattc actattcagg ctccagtggc 
actctggttt ctccctacag tgtcttttta 
tatccagatg tgtgataatt ggtgaggttg 
tgacctctga cagtccgttt cccttgacac 
acagtccagt ttgttccagt agcagggaca 
gggggtggag agcctggatg gtagctctgg 
tggttcctgt ttcactggct gttttagttt 
ctccgcaccc tcgtttataa taaatgaata 



ctgcaccgtg gtcctgggct tggtcctggg 60 
ccgtcccgcc atggctgcgc cggcgccctc 120 
gctactgccg tctccgggtg cccatggcga 180 
gatcccagag tcctgtcctg acttctgttg 240 
tgacgtgctg aagaaaatcc agtggaatga 300 
attttccgcc cacccggaga caccagaaca 360 
tcttgacagt gacaacatgc cagggttcgg 420 
cgtggtgttt atcgctacca tcattgtgtg 4 80 
gatgtgctgc cgcccacgac ctgtcgtgtc 540 
cgcttaccct cagcctcaac ctgtggcccc 600 
ccatcccatg cccccccagc caggaatgcc 660 
accctacctg gcccagccca cagggccacc 720 
ccagcctcca tacaacccgg cctacatgga 780 
ccagcctctt tggctaacat ttgattatgt 840 
ttactgctgt ctgtggtgcg tgtgccttgt 900 
aggtaggcac aaatcttacc agtgctggtt 960 
attgtaattt ctgaaatttc aagtaaatta 1020 
tcaccccaag gtgaccagcc atagcctgcc 1080 
catgtcctgc tttggggagt agccagctag 1140 
gctgcattta gctggacaga gaacaagggg 1200 
cagagcacac taggcagaga gccccgtccg 1260 
actgtctttc ctgggactat gtagagggcc 1320 
ttccaggcca ggggcctctg tctactacac 1380 
cgattagcca aacatattgc ctgttttttg 1440 
aaatccttgg ttcctggaga acaggaaacc 1500 
cagcttcata gaatacctga ctcctgtact 1560 
ccagggccag gggttatctg gaccaagggt 1620 
accagatgtg aatgcctcca tattccctgt 1680 
tgtgttaatt ggtgtttctg agcattcaaa 1740 
tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 
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a 1801 

<210> 47 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 47 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 

ccaaaggcag ttccc 735 

<210> 48 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
1 5 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn 
35 40 45 

Ser lie Pro Glu Ser Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser 
50 55 60 

Gin Tyr Cys Cys Ser Asp Val Leu Lys Lys He Gin Trp Asn Glu Glu 
65 70 75 80 

Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Ala His Pro Glu Thr 
85 90 95 

Pro Glu Gin Leu Gly Ser Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser 
100 105 110 

Asp Asn Met Pro Gly Phe Gly Ala Thr Val Ala He Gly Leu Thr Val 
115 120 125 

Phe Val Val Phe He Ala Thr He lie Val Cys Phe Thr Cys Ser Cys 
130 135 140 

Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn 
145 150 155 160 
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Thr Thr Thr Thr Thr Val Val His Thr Ala Tyr Pro Gin Pro Gin Pro 
165 170 175 

Val Ala Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met 
180 185 ' 190 

Pro Pro Gin Pro Gly Met Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro 
195 200 205 

Pro Pro Tyr Leu Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr 
210 215 220 

Leu Ala Gly Ala Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro 
225 230 235 240 

Pro Lys Ala Val. Pro 
245 



<210> 49 

<211> 38 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly 

3.5 



<210> 50 

<211> 207 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Glu Ser Cys 
1 5 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 " 30 

Val Leu Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 "* 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala He Gly Leu Thr Val Phe Val Val Phe He Ala 
85 90 95 
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Thr He He Val Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met 
100 105 HO 

Cys Cys Arg Pro Arg Pro Val Val Ser Asn Thr Thr Thr Thr Thr Val 
115 120 125 

Val His Thr Ala Tyr Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro 
130 135 140 

Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Gin Pro Gly Met 
145 150 155 160 

Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro Pro Pro Tyr Leu Ala Gin 
165 170 " 175 

Pro Thr Gly Pro Pro Ala Tyr His Glu Thr Leu Ala Gly Ala Ser Gin 
180 185 190 

Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro Pro Lys Ala Val Pro 
195 200 205 



<210> 51 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Glu Ser Cys 
1 5 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala 
85 



<210> 52 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 52 

He Gly Leu Thr Val Phe Val Val Phe He Ala Thr He He Val Cys 
1 5 io 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 " 25 
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<210> 53 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<400> -53 

Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn Thr Thr Thr Thr 
1 5 10 15 

Thr Val Val His Thr Ala Tyr Pro Gin Pro Gin Pro Val Ala Pro Ser 
20 25 30 

Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Gin Pro 
35 40 45 

Gly Met Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro Pro Pro Tyr Leu 
50 55 60 

Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr Leu Ala Gly Ala 
65 70 75 80 

Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro Pro Lys Ala Val 
85 90 95 

Pro 



<210> 54 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Glu Ser Cys Pro Asp 
1 5 10 15 

Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp Val Leu 
20 25 30 

Lys Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser 
35 40 . 45 

Arg Phe 
50 



<210> 55 

<211> 56 

<212> PRT 

<213> Homo sapiens 



<400> 55 

Thr Val Phe Val Val Phe lie Ala Thr lie He Val Cys Phe Thr Cys 
1 5 10 15 

Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val 
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20 



25 



30 



Ser Asn Thr Thr Thr Thr Thr Val Val His Thr Ala Tyr Pro Gin Pro 
35 40 45 



Gin Pro Val Ala Pro Ser Tyr Pro 
50 55 



<210> 56 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 56 

gtcgacccac gcgtccgcgc ggaggttgcg 
tgttcgtccg tccgttggtc tgtcccgcca 
tattgctgct gctgttgctg ctgccgccgc 
cctttggtga agacaattcg atcccagtgt 
ccaaccaata ctgctgctcg gacgtgctga 
ctgagccaga gtccagcaga ttttccaccc 
cagcgctgaa atttcgatcc agttttgaca 
tcgccattgg cgtgaccatc tttgtggtgt 
gctcctgctg ctgtctgtat aagatgtgct 
caactactac cgtggttcat gccccttacc 
gctatcctgg accaacatac cagggctacc 
gcagcaccct acccaacgca gtacccacca 
ccctaccatg agtccttggc tggagccagc 
tccctaaaga caattccctg aacctgcccc 
gtgagtgagt gatacgcaga gttctttact 
catgtggctt cctctgctgt tgaccaggta 
caacctgttt tcttcctcac ttgaaattgt 
aataaggtag gaggtatttc ccacgtcacc 
taggagagca agctttttgc gggtacagag 
gctaggcctt tattcccagg gtttggctgc 
caccaggtga caaggggact cagtggcagg 
tccatcagct gtctgtctgg atgtcactgt 
tgttcactat tcaggctcca ctgggggaat 
cccaggccag ggacctcggt ctgtctacta 
ttactgttag ccaaacattt tgcctgtttt 
ggttgaaatc cctggttcct ggaggacaga 
gacaccatct tcatagaaat acctgactcc 
gggacaccaa ggccaatggg ttatctggac 
ctccggccca gatgtgaata cctccatatt 
tagctttgtg ttgattggtg tttctgagca 
tgcaacattg gaaaaaaaaa aaaaaaaaaa 

<210> 57 

<211> 639 

<212> DNA 

<213> Mus musculus 



gcggcaccgt ggtcttgggc ttggtccgtc 60 
tggctgcgcc ggcgccctct ctgtggaccc 120 
ctccgggtgc ccatggtgag ctgtgcaggc. 180 
tctgtcctga tttctgttgt ggttcctgtt 240 
ggaaaatcca gtggaatgag gaaatgtgtc 300 
ccgcggagga gacacccgaa catctgggtt 360 
gtgaccctat gtcagggttc ggagcgaccg 420 
ttattgccac tatcatcatc tgcttcacct 480 
gcccccaacg ccctgtcgtg accaacacca 540 
ctcagcctca acctcaacct gtggccccca 600 
atcccatgcc ccccccagcc aggaatgcca 660 
ccctacctgg cccagcccac agggccgcca 720 
cagcctccat acaacccgac ctacatggat 780 
cagcctcttt ggctgccatt tatgtcgtgt 840 
gctgtctgtg gtgtgtgtgc cttgtctaga 900 
ggcgcaagtc ttaccagtgt gggtcgggac 960 
actttctgaa atttcaagca aattaaaaac 1020 
ccaaggtgac cagccatggc ctgtcatact 1080 
caggctttgg ggggtaacca gctagctgct 1140 
attggcagtg aggcaggtgg ctgggggtga 1200 
gggtcacacc aggcagaaca ccatacactc 1260 
ccttcccggg gctgtataga gggccacatg 1320 
tttcctacct ttgctggctt ggctcctgct 1380 
cacactctgg tttctccctg cactgtcttt 1440 
ctgtctccag atgtgtgata attggtgtga 1500 
caacctgacc tccgactgtc agtttccctt 1560 
tgtaccacag tccagtttgt cccagtagca 1620 
caaaggtggg gtggagggcc tagatggtat 1680 
ccctgttggt tcctgtttca ctggctgttt 1740 
ttcagactcc gcaccctcat ttctaataaa 1800 
aaaaaaaaaa aaaaaaaagg gcggccgc 1858 



<400> 57 

atggctgcgc cggcgccctc tctgtggacc 
cctccgggtg cccatggtga gctgtgcagg 
ttctgtcctg atttctgttg tggttcctgt 
aggaaaatcc agtggaatga ggaaatgtgt 
cccgcggagg agacacccga acatctgggt 



ctattgctgc tgctgttgct gctgccgccg 60 

ccctttggtg aagacaattc gatcccagtg 120 

tccaaccaat actgctgctc ggacgtgctg 180 

cctgagccag agtccagcag attttccacc 240 

tcagcgctga aatttcgatc cagttttgac 300 
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agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 
tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 
tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 4 80 
cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 
catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 
accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 58 

<211> 213 

<212> PRT 

<213> Mus musculus 

<400> 58 

Met Ala Ala Pro Ala Pro Ser Leu Trp Thr Leu Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Pro Pro Pro Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe 
20 25 * 30 

Gly Glu Asp Asn Ser lie Pro Val Phe Cys Pro Asp Phe Cys Cys Gly 
35 40 45 

Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp Val Leu Arg Lys He Gin 
50 55 60 



Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Thr 
65 70 75 80 

Pro Ala Glu Glu Thr Pro Glu His Leu Gly Ser Ala Leu Lys Phe Arg 
85 90 95 

Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 
100 105 110 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 
130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro Tyr 
145 150 155 160 

Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 
165 170 175 



Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 
180 185 190 

Thr Leu Pro Asn Ala Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 
195 200 205 

Ala Ala Thr Leu Pro 
210 



<210> 59 
<211> 26 
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<212> PRT 

<213> Mus musculus 

<400> 59 

Met Ala Ala Pro Ala Pro Ser Leu 
1 5 

Leu Leu Pro Pro Pro Pro Gly Ala 
20 



Trp Thr Leu Leu Leu Leii Leu Leu 
10 15 

His Gly 
25 



<210> 60 

<211> 187 

<212> PRT 

<213> Mus musculus 

<400> 60 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Val Phe Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 

Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala lie Gly Val Thr He Phe Val Val Phe He 
85 90 95 

Ala Thr He lie He Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys 
100 105 110 

Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr 
115 120 125 

Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro 
130 135 140 

Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Pro 
145 150 155 160 

Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr Leu 
165 170 175 

Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
180 185 



<210> 61 

<211> 86 

<212> PRT 

<213> Mus musculus 
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<400> 61 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Val Phe Cys 
1 5 -10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 

Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala 
85 



<210> 62 

<211> 25 

<212> PRT 

<213> Mus musculus 



<400> 62 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
15 10 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 25 



<210> 63 

<211> 76 

<212> PRT 

<213> Mus musculus 



<400> 63 

Lys Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr 
1 5 10 15 

Thr Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala 
20 25 30 

Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro 
35 40 45 

Pro Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr 
50 55 60 

Leu Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
65 70 75 



<210> 64 
<211> 50 
<212> PRT 
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<213> Mus musculus 
<400> 64 

Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser 
1 5 10 15 

Asp Val Leu Arg Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro 
20 25 30 

Glu Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu 
35 40 45 

Gly Ser 
50 



<210> 65 
<211> 56 
<212> PRT 

<213> Mus musculus 
<400> 65 

Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin 
1 5 10 15 

Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro 
20 25 30 

Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro 
. 35 40 45 

Thr Tyr Gin Gly Tyr His Pro Met 
50 55 



<210> 66 
<211> 1927 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 66 

ccccgcctcc aaagctaacc ctcgggcttg aggggaagan gctgactgta cgttccttct 60 
actctggcac cactctccag gctgccatgg ggcccagcac ccctctcctc atcttgttcc 120 
ttttgtcatg gtcgggaccc ctccaaggac agcagcacca ccttgtggag tacatggaac 180 
gccgactagc tgctttagag gaacggctgg cccagtgcca ggaccagagt agtcggcatg 240 
ctgctgagct gcgggacttc aagaacaaga tgctngccac tgctggaggt ggcagagaag 300 
gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 
cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 
aaggttgact ggaggccctg ggaccaaagg caagggaaga aggaatgaga agtacgatat 480 
ggtgacagac tgtggctaca caatctctca agtgagatca atgaagattc tgaagcgatt 540 
tggtggccca gctggtctat ggaccaagga tccactgggg caaacagaga agatctacgt 600 
gttagatggg acacagaatg acacagcctt tgtcttccca aggctgcgtg acttcaccct 660 
tgccatggct gcccggaaag cttcccgagt ccgggtgccc ttcccctggg taggcacagg 720 
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gcagctggta tatggtggct ttctttattt tgctcggagg cctcctggaa gacctggtgg 780 
aggtggtgag atggagaaca ctttgcagct aatcaaattc cacctggcaa accgaacagt 84 0 
ggtggacagc tcagtattcc cagcagaggg gctgatcccc ccctacggct tgacagcaga 900 
cacctacatc gacctggcag ctgatgagga aggtctttgg gctgtctatg ccacccggga 960 
ggatgacagg cacttgtgtc tggccaagtt agatccacag acactggaca cagagcagca 1020 
gtgggacaca ccatgtccca gagagaatgc tgaggctgcc tttntcatct gtgggaccct 1080 
ctatgtcgtc tataacaccc gtcctgccag tcgggcccgc atccagtgct cctttgatgc 1140 
cagcggaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 
catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatcgctac 1260 
cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 
gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 
aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 
agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 
atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 
gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 
gtccagctct gtcctctctt cctcactcct cccttcagtg tcctcaggaa caggactttc 1740 
tccacattgt tttgtattgc aacattttgc attaaaaagg aaaatccana aaaaaaaaaa 1800 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1860 
aaactgcggc cgctgtccct tctgtcgtct tctcgcagcc gtacccttct gtcgtcttct 1920 
cgcagcc 1927 

<210> 67 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Met Val Gly Lys Met Trp Pro Val Leu Trp Thr Leu Cys Ala Val Arg 
1 5 10 15 

Val Thr Val Asp Ala lie Ser Val Glu Thr Pro Gin Asp Val Leu Arg 
20 25 30 

Ala Ser Gin Gly Lys Ser Val Thr Leu Pro Cys Thr Tyr His Thr Ser 
35 40 45 

Thr Ser Ser Arg Glu Gly Leu He Gin Trp Asp Lys Leu Leu Leu Thr 
50 55 60 

His Thr Glu Arg Val Val He Trp Pro Phe Ser Asn Lys Asn Tyr He 
65 70 75 80 

His Gly Glu Leu Tyr Lys Asn Arg Val Ser He Ser Asn Asn Ala Glu 
85 90 95 

Gin Ser Asp Ala Ser He Thr He Asp Gin Leu Thr Met Ala Asp Asn 
100 105 110 

Gly Thr Tyr Glu Cys Ser Val Ser Leu Met Ser Asp Leu Glu Gly Asn 
115 120 125 

Thr Lys Ser Arg Val Arg Leu Leu Val Leu Val Pro Pro Ser Lys Pro 
130 135 140 

Glu Cys Gly lie Glu. Gly Glu Thr He He Gly Asn Asn He Gin Leu 
145 150 155 160 
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Thr Cys Gin Ser Lys Glu Gly Ser Pro Thr Pro Gin Tyr Ser Trp Lys 
165 170 175 

Arg Tyr Asn lie Leu Asn Gin Glu Gin Pro Leu Ala Gin Pro Ala Ser 
180 185 190 

Gly Gin Pro Val Ser Leu Lys Asn He Ser Thr Asp Thr Ser Gly Tyr 
195 200 205 

Tyr He Cys Thr Ser Ser Asn Glu Glu Gly Thr Gin Phe Cys Asn He 
210 215 220 

Thr Val Ala Val Arg Ser Pro Ser Met Asn Val Ala Leu Tyr Val Gly 
225 230 235 240 

He Ala Val Gly Val Val Ala Ala Leu He He He Gly He He He 
245 250 " 255 

Tyr Cys Cys Cys Cys Arg Gly Lys Asp Asp Asn Thr Glu Asp Lys Glu 
260 265 270 

Asp Ala Arg Pro Asn Arg Glu Ala Tyr Glu Glu Pro Pro Glu Gin Leu 
275 280 285 

Arg Glu Leu Ser Arg Glu Arg Glu Glu Glu Asp Asp Tyr Arg Gin Glu 
290 295 300 



Glu Gin Arg Ser Thr Gly Arg Glu Ser Pro Asp His Leu Asp Gin 
305 310 315 



<210> 68 
<211> 2793 
<212> DNA 

<213> Homo sapiens 
<400> 68 

ctaccccttt gtgagcagtc taggactttg 
aggtggctgg tttaagggga acttgaggga 
agtaggtgac acatgagccc agccccagct 
gccaatccag ctgaggctgg gcagaggtgg 
agttgggcca ggccagaagc tgctgtagct 
ctgccagtga caggttaggt ttagggcaga 
ggcctgtgtt gtggacactc tgtgcagtca 
ctccgcagga cgttcttcgg gcttcgcagg 
acacttccac ctccagtcga gagggactta 
cggaaagggt ggtcatctgg ccgttttcaa 
agaatcgcgt cagcatatcc aacaatgctg 
agctgaccat ggctgacaac ggcacctacg 
agggcaacac caagtcacgt gtccgcctgt 
gcggcatcga gggagagacc ataattggga 
agggctcacc aacccctcag tacagctgga 
ccctggccca gccagcctca ggtcagcctg 
cgggttacta catctgtacc tccagcaatg 
tggccgtcag atctccctcc atgaacgtgg 
ttgcagccct cattatcatt ggcatcatca 
acaacactga agacaaggag gatgcaaggc 
agcagctaag agaactttcc agagagaggg 



tacacctgtt aagtagggag aaggcagggg 60 
agtagggaag actcctcttg ggacctttgg 120 
cacctgccaa tccagctgag gagctcacct 180 
gtgagaagag ggaaaattgc agggacctcc 240 
ttaaccagac agctcagacc tgtctggagg 300 
gaagaagcaa gaccatggtg gggaagatgt 360 
gggtgaccgt cgatgccatc tctgtggaaa 420 
gaaagagtgt caccctgccc tgcacctacc 480 
ttcaatggga taagctcctc ctcactcata 540 
acaaaaacta catccatggt gagctttata 600 
agcagtccga tgcctccatc accattgatc 660 
agtgttctgt ctcgctgatg tcagacctgg 720 
tggtcctcgt gccaccctcc aaaccagaat 780 
acaacatcca gctgacctgc caatcaaagg 840 
agaggtacaa catcctgaat caggagcagc 900 
tctccctgaa gaatatctcc acagacacat 960 
aggaggggac gcagttctgc aacatcacgg 102 0 
ccctgtatgt gggcatcgcg gtgggcgtgg 1080 
tctactgctg ctgctgccga gggaaggacg 1140 
cgaaccggga agcctatgag gagccaccag 12 00 
aggaggagga tgactacagg caagaagagc 1260 
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agaggagcac tgggcgtgaa tccccggacc acctcgacca gtgacaggcc agcagcagag 1320 
ggcggcggag gaagggttag gggttcattc tcccgcttcc tggcctccct tctcctttct 1380 
aagccctgtt ctcctgtccc tccatcccag acattgatgg ggacatttct tccccagtgt 144 0 
cagctgtggg gaacatggct ggcctggtaa gggggtccct gtgctgatcc tgctgacctc 1500 
actgtcctgt gaagtaaccc ctcctggctg tgacacctgg tgcgggcctg gccctcactc 1560 
aagaccaggc tgcagcctcc acttccctcg tagttggcag gagctcctgg aagcacagcg 162 0 
ctgagcatgg ggcgctccca ctcagaactc tccagggagg cgatgccagc cttggggggt 1680 
gggggctgtc ctgctcacct gtgtgcccag cacctggagg ggcaccaggt ggagggtttg 174 0 
cactccacac atctttcttg aatgaatgaa agaataagtg agtatgcttg ggccctgcat 1800 
tggcctggcc tccagctccc actccctttc caacctcact tcccgtagct gccagtatgt 1860 
tccaaaccct cctgggaagg ccacctccca ctcctgctgc acaggccctg gggagctttt 1920 
gcccacacac tttccatctc tgcctgtcaa tatcgtacct gtccctccag gcccatctca 1980 
aatcacaagg atttctctaa ccctatccta attgtccaca tacgtggaaa caatcctgtt 2040 
actctgtccc acgtccaatc atgggccaca aggcacagtc ttctgagcga gtgctctcac 2100 
tgtattagag cgccagctcc ttggggcagg gcctgggcct catggctttt gctttccctg 2160 
aagccctagt agctggcgcc catcctagtg ggcacttaag cttaattggg gaaactgctt 2220 
tgattggttg tgccttccct tctctggtct ccttgagatg atcgtagaca cagggatgat 2280 
tcccacccaa acccacgtat tcattcagtg agttaaacac gaattgattt aaagtgaaca 2340 
cacacaaggg agcttgcttg cagatggtct gagttcttgt gtcctggtaa ttcctctcca 2400 
ggccagaata attggcatgt ctcctcaacc cacatggggt tcctggttgt tcctgcatcc 2460 
cgatacctca gccctggccc tgcccagccc atttgggctc tggttttctg gtggggctgt 2520 
cctgctgccc tcccacagcc tccttctgtt tgtcgagcat ttcttctact cttgagagct 2580 
caggcagcgt tagggctgct taggtctcat ggaccagtgg ctggtctcac ccaactgcag 2640 
tttactattg ctatcttttc tggatgatca gaaaaataat tccataaatc tattgtctac 2700 
ttgcgatttt ttaaaaaatg tatattttta tatatattgt taaatccttt gcttcattcc 2760 
aaatgctttc agtaataata aaattgtggg tgg 2793 

<210> 69 

<211> 52 

<212> PRT 

<213> Homo sapiens 

<400> 69 

Lys Thr Ala Leu Gly Glu Leu Leu Lys Pro Leu Asn Ser Glu Tyr Gly 
1 5 10 15 

Lys Val Ala Pro Gly Trp Gly Thr Thr Pro Leu Met Gly Val Phe Met 
20 25 30 

Ala Leu Phe Ala Val Phe Leu Leu He He Leu Glu He Tyr Asn Ser 
35 40 45 

Ser Val Leu Leu 
50 



<210> 70 

<211> 1832 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 70 

tgtggctgac gtcatctgga ggagatttgc tttctttttc tccaaaaggg gaggaaattg 60 
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aaactgcagt ggcccacgat gggaagaggg gaaagcccag gggtacagga ggcctctggg 12 0 
tgaaggcaga ggctaacatg gggttcggag cgaccttggc cgttggctga ccatctttgt 180 
gctgtctgtc gtcactatca tcatctgctt cacctgctcc tgctgctgcc tttacaagac 240 
gtgccgccga ccacgtccgg ttgtcaccac caccacatcc accactgtgg tgcatgcccc 300 
ttatcctcag cctccaagtg tgccgcccag ctaccctgga ccaagctacc agggctacca 360 
caccatgccg cctcagccag ggatgccagc agcaccctac ccaatgcagt acccaccacc 420 
ttacccagcc cagcccatgg gcccaccggc ctaccacgag accctggctg gaggagcagc 480 
cgcgccctam cccgscagcc agcctcctta caacccggcc tacatggatg cccgaagcgg 540 
ccctctgagc attccctggc ctctytggct gccacttggt tatgttgtgt gtgtgcgtra 600 
gtggtgtgca ggcgcggttc cttacgcccc atgtgtgctg tgtgtgtcca ggcacggttc 660 
cttacgcccc atgtgtgctg tgtgtgtcct gcctgtatat gtggcttcct ctgatgctga 720 
caagtgggga acaatccttg ccagagtggg ctgggaccag actttgttct cttcctcacc 780 
tgaaattatg cttcctaaaa tctcaagcca aactcaaaga atggggtggt ggggggcacc 840 
ctgtgaggtg gcccctgaga ggtgggggcc tctccagggc acatctggag ttcttctcca 900 
gcttacccta gggtgaccaa gtagggcctg tcacaccagg gtggcgcast ttctgtgtga 960 
tgcagatgtg tcctggtttc ggcagcgtag ccagctgctg cttgaggcca tggctcgtcc 1020 
ccggagttgg gggtacccgt tgcagagcca gggacatgat gcaggcgaag yttgggatct 1080 
ggccaagttg gactttgatc ctttgggcag atgtcccatt gctccctgga gcctgtcatg 114 0 
cctgttgggg atcaggcagc ctcctgatgc cagaacacct caggcagagc cctactcagc 1200 
tgtacctgtc tgcctggact gtcccctgtc cccgcatctc ccctgggacc agctggaggg 1260 
ccacatgcac acacagccta gctgccccca gggagctctg ctgcccttgc tggccctgcc 1320 
cttcccacag gtgagcaggg ctcctgtcca ccagcacact cagttctctt ccctgcagtg 1380 
ttttcatttt attttagcca aacattttgc ctgttttctg tttcaaacat gatagttgat 1440 
atgagactga aacccctggg ttgtggaggg aaattggctc agagatggac aacctggcaa 1500 
ctgtgagtcc ctgcttcccg acaccagcct catggaatat gcaacaactc ctgtacccca 1560 
gtccacggtg ttctggcagc agggacacct gggccaatgg gccatctgga ccaaaggtgg 1620 
ggtgtggggc cctggatggc agctctggcc cagacatgaa tacctcgtgt tcctcctccc 1680 
tctattactg tttcaccaga gctgtcttag ctcaaatctg ttgtgtttct gagtctaggg 1740 
tctgtacact tgtttataat aaatgcaatc gtttnggaaa aaaaananaa aaaaaaaagg 1800 
ggsggcgctc taaaaggatn ccccnaaggg gg 1832 

<210> 71 
<211> 51 
<212> PRT 

<213> Mus musculus 
<400> 71 

Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg Leu Ala 
1 5 10 15 

Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val Arg Gly 
20 25 30 

Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val Tyr Val 
35 40 45 

Cys Lys Ala 
50 



<210> 72 

<211> 2557 

<212> DNA 

<213> Homo sapiens 

<400> 72 

gaattccggg agaagtgacc agagcaattt ctgcttttca cagggcgggt ttctcaacgg 60 
tgacttgtgg gcagtgcctt ctgctgagcg agtcatggcc cgaaggcaga actaactgtg 120 
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cctgcagtct tcactctcag gatgcagccg aggtgggccc aaggggccac gatgtggctt 180 
ggagtcctgc tgacccttct gctctgttca agccttgagg gtcaagaaaa ctctttcaca 240 
atcaacagtg ttgacatgaa gagcctgccg gactggacgg tgcaaaatgg gaagaacctg 300 
accctgcagt gcttcgcgga tgtcagcacc acctctcacg tcaagcctca gcaccagatg 360 
ctgttctata aggatgacgt gctgttttac aacatctcct ccatgaagag cacagagagt 420 
tattttattc ctgaagtccg gatctatgac tcagggacat ataaatgtac tgtgattgtg 4 80 
aacaacaaag agaaaaccac tgcagagtac cagctgttgg tggaaggagt gcccagtccc 54 0 
agggtgacac tggacaagaa agaggccatc caaggtggga tcgtgagggt caactgttct 600 
gtcccagagg aaaaggcccc aatacacttc acaattgaaa aacttgaact aaatgaaaaa 660 
atggtcaagc tgaaaagaga gaagaattct cgagaccaga attttgtgat actggaattc 720 
cccgttgagg aacaggaccg cgttttatcc ttccgatgtc aagctaggat catttctggg 780 
atccatatgc agacctcaga atctaccaag agtgaactgg tcaccgtgac ggaatccttc 840 
tctacaccca agttccacat cagccccacc ggaatgatca tggaaggagc tcagctccac 900 
attaagtgca ccattcaagt gactcacctg gcccaggagt ttccagaaat cataattcag 960 
aaggacaagg cgattgtggc ccacaacaga catggcaaca aggctgtgta ctcagtcatg 1020 
gccatggtgg agcacagtgg caactacacg tgcaaagtgg agtccagccg catatccaag 1080 
gtcagcagca tcgtggtcaa cataacagaa ctattttcca agcccgaact ggaatcttcc 1140 
ttcacacatc tggaccaagg tgaaagactg aacctgtcct gctccatccc aggagcacct 1200 
ccagccaact tcaccatcca gaaggaagat acgattgtgt cacagactca agatttcacc 1260 
aagatagcct caaagtcgga cagtgggacg tatatctgca ctgcaggtat tgacaaagtg 1320 
gtcaagaaaa gcaacacagt ccagatagtc gtatgtgaaa tgctctccca gcccaggatt 1380 
tcttatgatg cccagtttga ggtcataaaa ggacagacca tcgaagtccg ttgcgaatcg 1440 
atcagtggaa ctttgcctat ttcttaccaa cttttaaaaa caagtaaagt tttggagaat 1500 
agtaccaaga actcaaatga tcctgcggta ttcaaagaca accccactga agacgtcgaa 1560 
taccagtgtg ttgcagataa ttgccattcc catgccaaaa tgttaagtga ggttctgagg 1620 
gtgaaggtga tagccccggt ggatgaggtc cagatttcta tcctgtcaag taaggtggtg 1680 
gagtctggag aggacattgt gctgcaatgt gctgtgaatg aaggatctgg tcccatcacc 1740 
tataagtttt acagagaaaa agagggcaaa cccttctatc aaatgacctc aaatgccacc 1800 
caggcatttt ggaccaagca gaaggctagc aaggaacagg agggagagta ttactgcaca 1860 
gccttcaaca gagccaacca cgcctccagt gtccccagaa gcaaaatact gacagtcaga 1920 
gtcattcttg ccccatggaa gaaaggactt attgcagtgg ttatcatcgg agtgatcatt 1980 
gctctcttga tcattgcggc caaatgttat tttctgagga aagccaaggc caagcagatg 2040 
ccagtggaaa tgtccaggcc agcagtacca cttctgaact ccaacaacga gaaaatgtca 2100 
gatcccaata tggaagctaa cagtcattac ggtcacaatg acgatgtcag aaaccatgca 2160 
atgaaaccaa taaatgataa taaagagcct ctgaactcag acgtgcagta cacggaagtt 2220 
caagtgtcct cagctgagtc tcacaaagat ctaggaaaga aggacacaga gacagtgtac 2280 
agtgaagtcc ggaaagctgt ccctgatgcc gtggaaagca gatactctag aacggaaggc 2340 
tcccttgatg gaacttagac agcaaggcca gatgcacatc cctggaagga catccatgtt 2400 
ccgagaagaa cagataatcc ctgtatttca agacctctgt gcacttattt atgaacctgc 24 60 
cctgctccca cagaacacag caattcctca ggctaagctg ccggttctta aatccatcct 2520 
gctaagttaa tgttgggtag aaagagatac agagggg 2557 

<210> 73 

<211> 738 

<212> PRT 

<213> Homo sapiens 

<400> 73 

Met Gin Pro Arg Trp Ala Gin Gly Ala Thr Met Trp Leu Gly Val Leu 
1 5 10 ' 15 

Leu Thr Leu Leu Leu Cys Ser Ser Leu Glu Gly Gin Glu Asn Ser Phe 
20 25 30 

Thr lie Asn Ser Val Asp Met Lys Ser Leu Pro Asp Trp Thr Val Gin 
35 40 45 

Asn Gly Lys Asn Leu Thr Leu Gin Cys Phe Ala Asp Val Ser Thr Thr 
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50 



55 



60 



Ser His Val Lys Pro Gin His Gin Met Leu Phe Tyr Lys Asp Asp Val 
65 70 75 80 

Leu Phe Tyr Asn lie Ser Ser Met Lys Ser Thr Glu Ser Tyr Phe lie 
85 90 95 

Pro Glu Val Arg He Tyr Asp Ser Gly Thr Tyr Lys Cys Thr Val He 
100 105 110 

Val Asn Asn Lys Glu Lys Thr Thr Ala Glu Tyr Gin Leu Leu Val Glu 
115 120 125 

Gly Val Pro Ser Pro Arg Val Thr Leu Asp Lys Lys Glu Ala He Gin 
130 135 140 

Gly Gly He Val Arg Val Asn Cys Ser Val Pro Glu Glu Lys Ala Pro 
145 150 155 160 

He His Phe Thr He Glu Lys Leu Glu Leu Asn Glu Lys Met Val Lys 
165 170 175 

Leu Lys Arg Glu Lys Asn Ser Arg Asp Gin Asn Phe Val He Leu Glu 
180 185 190 

Phe Pro Val Glu Glu Gin Asp Arg Val Leu Ser Phe Arg Cys Gin Ala 
195 200 205 

Arg He He Ser Gly He His Met Gin Thr Ser Glu Ser Thr Lys Ser 
210 215 220 

Glu Leu Val Thr Val Thr Glu Ser Phe Ser Thr Pro Lys Phe His He 
225 230 235 240 

Ser Pro Thr Gly Met lie Met Glu Gly Ala Gin Leu His He Lys Cys 
245 250 255 

Thr He Gin Val Thr His Leu Ala Gin Glu Phe Pro Glu He He He 
260 265 270 

Gin Lys Asp Lys Ala He Val Ala His Asn Arg His Gly Asn Lys Ala 
275 280 285 

Val Tyr Ser Val Met Ala Met Val Glu His Ser Gly Asn Tyr Thr Cys 
290 295 300 

Lys Val Glu Ser Ser Arg He Ser Lys Val Ser Ser He Val Val Asn 
305 310 315 320 

He Thr Glu Leu Phe Ser Lys Pro Glu Leu Glu Ser Ser Phe Thr His 
325 330 335 

Leu Asp Gin Gly Glu Arg Leu Asn Leu Ser Cys Ser He Pro Gly Ala 
340 345 350 



Pro Pro Ala Asn Phe Thr He Gin Lys Glu Asp Thr He Val Ser Gin 
355 360 365 
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Thr Gin Asp Phe Thr Lys lie Ala 
370 375 

lie Cys Thr Ala Gly He Asp Lys 
385 ^90 

Gin He Val Val Cys Glu Met Leu 
405 

Ala Gin Phe Glu Val He Lys Gly 
420 



Ser Lys Ser Asp Ser Gly Thr Tyr 
380 

Val Val Lys Lys Ser Asn Thr Val 
395 400 

Ser Gin Pro Arg He Ser Tyr Asp 
410 415 

Gin Thr He Glu Val Arg Cys Glu 
425 430 



Ser He Ser Gly Thr Leu Pro He Ser Tyr Gin Leu Leu Lys Thr Ser 
435 440 445 

Lys Val Leu Glu Asn Ser Thr Lys Asn Ser Asn Asp Pro Ala Val Phe 
450 455 460 

Lys Asp Asn Pro Thr Glu Asp Val Glu Tyr Gin Cys Val Ala Asp Asn 
465 470 475 480 

Cys His Ser His Ala Lys Met Leu Ser Glu Val Leu Arg Val Lys Val 
485 490 495 

lie Ala Pro Val Asp Glu Val Gin He Ser He Leu Ser Ser Lys Val 
500 505 510 

Val Glu Ser Gly Glu Asp He Val Leu Gin Cys Ala Val Asn Glu Gly 
515 520 525 

Ser Gly Pro He Thr Tyr Lys Phe Tyr Arg Glu Lys Glu Gly Lys Pro 
530 535 540 

Phe Tyr' Gin Met Thr Ser Asn Ala Thr Gin Ala Phe Trp Thr Lys Gin 
545 550 555 560 

Lys Ala Ser Lys Glu Gin Glu Gly Glu Tyr Tyr Cys Thr Ala Phe Asn 
565 570 575 

Arg Ala Asn His Ala Ser Ser Val Pro Arg Ser Lys He Leu Thr Val 
580 585 " 590 

Arg Val He Leu Ala Pro Trp Lys Lys Gly Leu He Ala Val Val He 
595 600 605 

He Gly Val He He Ala Leu Leu He He Ala Ala Lys Cys Tyr Phe 
610 615 620 

Leu Arg Lys Ala Lys Ala Lys Gin Met Pro Val Glu Met Ser Arg Pro 
625 630 635 640 

Ala Val Pro Leu Leu Asn Ser Asn Asn Glu Lys Met Ser Asp Pro Asn 
645 650 655 

Met Glu Ala Asn Ser His Tyr Gly His Asn Asp Asp Val Arg Asn His 
660 665 670 

Ala Met Lys Pro He Asn Asp Asn Lys Glu Pro Leu Asn Ser Asp Val 
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675 680 685 

Gin. Tyr Thr Glu Val Gin Val Ser Ser Ala Glu Ser His Lys Asp Leu 
690 695 700 

Gly Lys Lys Asp Thr Glu Thr Val Tyr Ser Glu Val Arg Lys Ala Val 
705 710 715 720 

Pro Asp Ala Val Glu Ser Arg Tyr Ser Arg Thr Glu Gly Ser Leu Asp 
725 730 735 

Gly Thr 



<210> 74 

<211> 601 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 

<400> 74 

gnnnnnnagg tntanagncn cctttacncc gccgcggacg cgtgggcgga cgcgtggggt 60 
gctgtggagc aagaagcaac ccgaagctag gagtctgtca gcgagggcag gggctgcctg 120 
gttggggtag gagtgggagc agggccagca ggagggtctg aggaagccat tcaaagcgag 180 
cagctgggag agctggggag ccgggaaggg cctacagact acaagagagg atcctggcgt 240 
ctgggcctcc tgggtcatca ccatgaggcc acttcttgcc ctgctgcttc tgggtctggc 300 
atcaggctct cctcctctgg acgacaacaa gatccccagc ctgtgtcccg ggcagcccgg 360 
cctcccaggc acaccaggcc accacggcag ccaaggcctg cctggccgtg acggccgtga 420 
tggccgcgac ggtgcacccg gagctccggg agagaaaggc gagggcggga gaccgggact 4 80 
acctgggcca cgtngggagc ccgggccgcg tggagaggca ggacctgtgg gggctatcgg 540 
gcctggnggg gaatgctcgg tgccccacga tcagcttcag tgccaagcga tcagaaagcc 600 
c 601 

<210> 75 
<211> 732 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 75 

gngngttnnn ttccncctcc gacttaaggc 
cttcttcctt ttgtcatggc cgggacccct 
catggaacgc cgactagctg ccttagagga 
tcggcatgct gctgagcttc gggacttcaa 
agagaaggag cgggaaacac tcagaaccga 
tcttgaacgg gaagtagact acctggagac 
ggatgagaag gtgactggag gccctggaac 
cgatatggtg acagactgta gctacacaat 
gcggtttggt ggctcagctg gcctatggac 



tgccatgggg cccagtgctc ctctgctcct 60 
tcagggacag cagcaccacc ttgtggagta 120 
gcggctggca cagtgccagg atcagagcag 180 
aaacaagatg ctgcctctac tggaggtggc 240 
ggcagacagc atttcaggaa gagtggaccg 300 
acagaaccca gctttgccct gtgtagaact 360 
caaaggcaag ggccggagaa atgagaaata 420 
ctctcaggtg aggtcaatga agatcctgaa 480 
caaggatcca ctggggccag canagaagat 540 
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ctacgtgtta gacggnacgc agaacgacac 
ccctcaccat ggctgccgca aagttccgaa 
aactggtgtn tgtggcttcc tttttatctc 
nggtggnaaa at 



ggccttcgtt ttccganggt gcgtgactta 600 
tcgggtgccc ttncctgggt agnacaagaa 660 
aangcntctg gaggaacttg nanggggggn 720 

732 



<210> 76 

<211> 177 

<212> PRT 

<213> Homo sapiens 



<400> 76 



Gin Leu Gin Leu 


His Leu Pro Ala Asn Arg 


Leu 


Gin 


Ala 


Val 


Glu 


Glu 


1 


5 10 










15 




Gly Glu Ser Gly 


Ala Ser Ala Trp Tyr Thr 


Leu 


His 


Arg 


Glu 


Val 


Ser 


20 


25 








30 






Ser Ser Gin Pro 


Trp Glu Val Pro Phe Val 


Met Trp 


Phe 


Phe 


Lys 


Gin 


35 


40 






45 






Lys Glu Lys Glu 


Asp Gin Val Leu Ser Tyr 


He 


Asn 


Gly 


Val 


Thr 


Thr 


50 


55 




60 








Ser Lys Pro Gly 


Val Ser Leu Val Tyr Ser 


Met 


Pro 


Ser 


Arg 


Asn 


Leu 


65 


70 


75 








80 


Ser Leu Arg Val 


Glu Gly Leu Gin Glu Lys 
85 90 


Asp 


Ser 


Gly 


Pro 


Tyr 
95 


Ser 


Cys Ser Val Asn 


Val Gin Asp Lys Gin Gly 


Lys 


Ser 


Arg 


Gly 


His 


Ser 


100 


105 








110 






lie Lys Thr Leu 


Glu Leu Asn Val Leu Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


115 


120 






125 








Cys Arg Leu Gin 


Gly Val Pro His Val Gly 


Ala 


Asn 


Val 


Thr 


Leu 


Ser 


130 


135 




140 










Cys Gin Ser Pro 


Arg Ser Lys Pro Ala Val 


Gin 


Tyr 


Gin 


Trp 


Asp 


Arg 


145 


150 


155 










160 


Gin Leu Pro Ser 


Phe Gin Thr Phe Phe Ala 


Pro 


Ala 


Leu 


Asp 


Val 


He 



165 170 175 



Arg 



<210> 77 

<211> 735 

<212> DNA 

<213> Homo sapiens 



<400> 77 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgao gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 

ccaaaggtag ttccc 735 

<210> 78 
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<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 78 

Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr 

1 5 10 15 

Val Cys 



<210> 79 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 

1 5 10 15 

Val Ser Thr Gly Pro Gly 
20 

<210> 80 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 80 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 
gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 
catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 
gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 
ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 
ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 
cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 
gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 
gggccagcct gcagtttgat ctggtgaaga atggcgaatc ccttgcctct ttcttccagt 540 
ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 
ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 
tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 
tctttgct 728 

<210> 81 
<211> 206 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa-unknown amino acid 
<400> 81 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 

1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 

20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 
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Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 

50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 

85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 

100 105 no 

Val Glu Gly Leu Gin Glu Lys Asp Ser. Gly Pro Tyr Ser Cys Ser Val 

115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 

130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 

165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 

180 185 ~ 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg 
195 200 205 

<210> 82 

<211> 217 

<212> PRT 

<213> Homo sapiens 

<400> 82 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 

1 5 10 15 

Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 

20 25 " 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 

35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr 

50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 ~ 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 

85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 

100 105 no 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 

115 120 125 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 

130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 

165 170 ~ 175 

Arg Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 

180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 

195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly 
210 215 

<210> 83 
<211> 220 
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<212> PRT 

<213> Homo sapiens 



<400> 83 












Gin Leu 


Gin 


Leu 


His 


Leu Pro 


Ala 


1 






5 






Gly Glu 


Ser Gly Ala 


Ser Ala 


xx y 






20 








Ser Ser 


Gin Pro Trp 


Glu Val 


c ±. <J 




35 








40 




Lys Glu Asp 


Gin Val 

VJXH VOX 


XJC IX 


■J w . 








55 






Pro Gly Val 


OCX. XJG Li 


Val 
val 


65 








70 




Sex* Leu 


Arg Val Glu 


Glv Leu 


Gin 








85 






Cys Ser 


Val 


Asn 


Val 


Gin Asn 


Lys 






100 








lie Lys 


Thr 


Leu 


Glu 


Leu Asn 


Val 




115 








120 


Cys Arg 


Leu Gin Gly 


Val Pro 


His 


130 








135 




Cvs Gin 


Ser Pro Arg 


Ser Lys 


Pro 


145 








150 




Gin Leu 


Pro 


Ser 


Phe 


Gin Thr 


Phe 








165 






rt. x. y oiy 


Ser 


Leu 


Ser 


UCU XXIX 








180 








Tyr Val 


Cys 


Lys 


Ala 


XI X o noil 


m ii 




195 








200 


Thr Leu 


Glu 


Val 


Ser 


Thr Gly 


Pro 


210 








^ X J 




<210> 84 












<211> 202 










<212> PRT 










<213> Homo sapiens 






<400> 84 












Met Gly 


Pro 


Ser 


Thr 


Pro Leu 


Leu 


1 






5 






Gly Pro 


Leu 


Gin 


Gly 


Gin Gin 


His 






20 








Arg Leu 


Ala 


Ala 


Leu 


Glu Glu 

VJ J. LI \JXU 


~y 




35 








40 


Ser Arg 


His 


Ala 


Ala 


m u TtAu 

VJXIX ucu 


«xg 


50 








55 




Leu Leu 


Glu 


Val 


Ala 


Ol ii Ta/q 
vjxu uy o 


fll ii 


65 








70 




Asp Thr 


lie 


Ser 


Gly 


ni. y v dx 


nap 








85 






Leu Glu 


Thr 


Gin 


Asn 


Pro Ala 


Leu 






100 








Val Thr 


Gly 


Gly 


Pro 


Gly Thr 


Lys 




115 








120 


Tyr Asp 


Met 


val 


Thr 


Asp Cys 


Gly 


130 








135 




Met Lys 


He 


Leu 


Lys 


Arg Phe 


Gly 



Asn 


Arcr 


Leu 


Gin 


Ala 


Val 


Glu 


Glu 




10 










15 




Tvr 


Thr 


Leu 


His 


Ara 


Glu 


Val 


Ser 


25 










30 






Phe 


Val 


Met 


TrD 

XT 


Phe 


Phe 


Lys 


Gin 










45 








Ser 


xyx 


Tie 

X X c 


Asn 


<axy 


Val 

v a x 


x tlx 


X XXX 








60 










Tvr 

xy x 


Ser 


Met 


Pro 


Ser 


Ara 


Asn 


uc vx 






75 










80 


Glu 


Lys 


Asp 


Ser 


Gly 


Pro 


Tvr 


Ser 




90 










95 




Gin 


Gly 


Lys 


Ser 


Ara 


Gly 


His 


Ser 


105 










110 






Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 










125 








Val 


Gly 


Ala 


Asn 


Val 


Thr 


Leu 


Ser 








140 










Ala 


Val 


Gin Tyr 


Gin 


TrD 


Asp 








155 










160 


Phe 


Ala 


Pro 


Ala 


Leu 


Asp 


Val 


He 




170 










175 




Leu 


Ser 


Ser 


Ser 


Met 


Ala 


Gly 


Val 


185 










190 






Val 


Gly 


Thr 


Ala 


Gin 


Cys 


Asn 


Val 










205 








Gly 


Ala 


Ala 


Val 
















220 










lie 


Leu 


Phe 


Leu 


Leu 


Ser 


Tro 


Ser 




10 










15 




His 


Leu 


Val 


Glu 


Tvr 


Met 


Glu 


Ara 


25 










30 






Leu 


Ala 


Gin Cys 


Gin 


Asd 

XT 


Gin 


Ser 










"X — > 








Asd 


Phe 


Lys Asn 


Lys 


Met 


Leu 


Pro 








60 












Gl ii 

wX IX 


Ala 


Leu 




X XIX 


m ii 

V7X IX 


Al A 
M.X ct 






75 












Aro 


Leu 


Glu Arg 


Glu 


Val 


Asp 


Tvr 

X 




90 










95 




Pro 


Cys 


Val 


Glu 


Phe 


Asp 


Glu 


Lys 


105 










110 






Gly 


Lys 


Gly Arg 


Arg 


Asn 


Glu 


Lys 










125 








Tyr 


Thr 


He 


Ser 


Gin 


Val 


Arg 


Ser 








140 










Gly 


Pro 


Ala Gly 


Leu 


Trp 


Thr 


Lys 
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145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg 
195 200 

<210> 85 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<400> 85 

Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser Gly Thr Leu 

1 5 10 15 

Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg Tyr Gly Ala 

20 25 30 

His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr Ala Trp 

35 40 45 

Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg Lys Lys Glu 

50 55 60 

Glu Glu Val 
65 

<210> 86 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 86 

Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe 

1 5 10 15 

Leu Tyr Phe 



<210> 87 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 87 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

1 5 10 15 

Thr 



<210> 88 

<211> 99 

<212> PRT 

<213> Homo sapiens 

<400> 88 

Ala Arg Arg Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn 

1 5. 10 15 

Thr Leu Gin Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp 

20 25 30 

Ser Ser Val Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 
Ala Asp Thr Tyr He Asp Leu Ala 

50 55 
Val Tyr Ala Thr Arg Glu Asp Asp 
65 70 
Asp Pro Gin Thr Leu Asp Thr Glu 
85 

Arg Glu Asn 



45 

Ala Asp Glu Glu Gly Leu Trp Ala 
60 

Arg His Leu Cys Leu Ala Lys Leu 

75 - ' 80 

Gin Gin Trp Asp Thr Pro Cys Pro 

90 95 



<210> 89 
<211> 320 
<212> PRT 

<213> Homo sapiens 
<400> 89 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 

1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 

210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 " 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 " 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

<210> 90 
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<211> 385 
<212> PRT 
<213> Homo sapiens 

<400> 90 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 

1 5 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 

20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 

35 40 45 

Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala Asp Thr He Ser Gly 

50 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys Val Thr Gly Gly Pro 

85 90 95 

Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 

100 105 110 

Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser Met Lys He Leu Lys 

115 120 125 

Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys Asp Pro Leu Gly Gin 

130 135 140 

Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 

Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala Met Ala Ala Arg Lys 

165 170 175 

Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu 

180 185 190 

Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg Pro Pro Gly Arg Pro 

195 200 205 

Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin Leu He Lys Phe His 

210 215 220 

Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val Phe Pro Ala Glu Gly 
225 230 235 240 

Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr Tyr He Asp Leu Ala 

245 250 255 

Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala Thr Arg Glu Asp Asp 

260 265 270 

Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin Thr Leu Asp Thr Glu 

275 280 285 

Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn Ala Glu Ala Ala Phe 

290, 295 300 

Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr Arg Pro Ala Ser 
305 310 315 320 

Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser Gly Thr Leu Thr Pro 

325 330 335 

Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg Tyr Gly Ala His Ala 

340 345 350 

Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr Ala Trp Asp Asp 

355 360 365 

Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg Lys Lys Glu Glu Glu 
370 375 380 

Val 
385 

<210> 91 
<211> 728 
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<212> DNA 

<213> Homo sapiens 



<400> 91 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg tgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 

tctttgct 728 

<210> 92 
<211> 69 
<212> PRT 

<213> Homo sapiens 



<400> 92 



Arg Pro Ala 


Ser Arg 


Ala 


Arg 


He 


1 


5 








Thr Leu Thr 


Pro Glu 


Arg 


Ala 


Ala 




20 








Gly Ala His 


Ala Ser 


Leu 


Arg 


Tyr 


35 








40 


Ala Trp Asp 


Asp Gly 


Tyr 


Gin 


He 


50 






55 




Lys Glu Glu 


Glu Val 









Gin Cys Ser Phe 


Asp 


Ala 


Ser 


Gly 


10 






15 




Leu Pro Tyr Phe 


Pro 


Arg 


Arg 


Tyr 


25 




30 






Asn Pro Arg Glu 


Arg 


Gin 


Leu 


Tyr 




45 








Val Tyr Lys Leu 


Glu 


Met 


Arg 


Lys 



60 



<210> 93 

<211> 202 

<212> PRT 

<213> Mus musculus 



<400> 93 



Met 


Gly 


Pro 


Ser 


Ala 


Pro 


Leu 


Leu Leu Leu Phe Phe 


Leu 


Ser 


Trp 


Thr 


1 








5 






10 






15 




Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His His Leu Val Glu 


Tyr 


Met 


Glu 


Arg 








20 








25 




30 






Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg Leu Ala Gin Cys 


Gin 


Asp 


Gin 


Ser 






35 










40 


45 








Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg Asp Phe Lys Asn 


Lys 


Met 


Leu 


Pro 




50 










55 


60 










Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu Arg Glu Thr Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 




75 








80 


Asp 


Ser 


He 


Ser 


Gly 


Arg 


Val 


Asp Arg Leu Glu Arg 


Glu 


Val 


Asp 


Tyr 










85 






90 






95 




Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu Pro Cys Val Glu 


Leu 


Asp 


Glu 


Lys 








100 








105 




110 






Val 


Thr 


Gly 


Gly 


Pro 


Gly 


Ala 


Lys Gly Lys Gly Arg 


Arg 


Asn 


Glu 


Lys 






115 










120 


125 








Tyr 


Asp 


Met 


Val 


Thr 


Asp 


Cys 


Ser Tyr Thr Val Ala 


Gin 


Val 


Arg 


Ser 
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130 










135 


Met 


Lys 


lie 


Leu 


Lys 


Arg 


Phe Gly 


145 










150 




Asp 


Pro 


Leu 


Gly 


Pro Ala Glu Lys 










165 






Asn 


Asp 


Thr 


Ala 
180 


Phe 


Val 


Phe Pro 


Met 


Ala 


Ala 
195 


Arg 


Lys 


Ala 


Ser Arg 
200 



140 

Gly Ser Val Gly Leu Trp Thr Lys 
155 160 
lie Tyr Val Leu Asp Gly Thr Gin 

170 175 
Arg Leu Arg Asp Phe Thr Leu Ala 
185 190 
lie Arg 



<210> 94 

<211> 69 

<212> PRT 

<213> Mus musculus 



<400> 94 






Arg Pro 


Ala 


Ser Arg Ala Arg lie 


1 




5 


Thr Leu 


Ala 


Pro Glu Arg Ala Ala 






20 


Gly Ala 


His 


Ala Ser Leu Arg Tyr 




35 


40 


Ala Trp 


Asp 


Asp Gly Tyr Gin lie 


50 




55 


Lys Glu 


Glu 


Glu Val 


65 







<210> 95 
<211> 19 
<212> PRT 

<213> Mus musculus 



Gin Cys Ser Phe Asp Ala Ser Gly 

10 15 
Leu Ser Tyr Phe Pro Arg Arg Tyr 
25 30 
Asn Pro Arg Glu Arg Gin Leu Tyr 
45 

Val Tyr Lys Leu Glu Met Lys Lys 
60 



<400> 95 

Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe 

1 5 10 15 

Leu Tyr Tyr 



<210> 96 
<211> 16 
<212> PRT 

<213> Mus musculus 
<400> 96 

Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr 
1 5 10 15 

<210> 97 
<211> 99 
<212> PRT 

<213> Mus musculus 
<400> 97 

Ala Arg Arg Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn 

15 10 15 

Thr Leu Gin Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp 

20 25 30 

Ser Ser Val Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 
Ala Asp Thr Tyr lie Asp Leu Ala 

50 55 
Val Tyr Ala Thr Arg Asp Asp Asp 
65 70 
Asp Pro Gin Thr Leu Asp Thr Glu 
85 

Arg Glu Asn 



45 

Ala Asp Glu Glu Gly Leu Trp Ala 
60 

Arg His Leu Cys Leu Ala Lys Leu 

75 80 
Gin Gin Trp Asp Thr Pro Cys Pro 
90 95 



<210> 98 
<211> 320 
<212> PRT 

<213> Mus musculus 
<400> 98 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leii Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 ~ 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Giy Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

<210> 99 
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<211> 299 
<212> PRT 
<213> Mus musculus 



<400> 99 
























Gin Gin 


His His 


Leu Val Glu 


Tyr 


Met 


Glu 


Arg 


Arg 


Leu 


Ala 


Ala 


Leu 


1 




5 






10 










15 




Glu Glu 


Arg Leu 


Ala Gin Cys 


Gin 


Asp 


Gin 


Ser 


Ser 


Arg 


His 


Ala 


Ala 




20 






25 










30 






Glu Leu 


Arg Asp 


Phe Lys Asn 


Lys 


Met 


Leu 


Pro 


Leu 


Leu 


Glu 


Val 


Ala 




35 




40 










45 








Glu Lys 


Glu Arg 


Glu Thr Leu 


Arg 


Thr 


Glu 


Ala 


Asp 


Ser 


He 


Ser Gly 


50 




55 










60 










Arg Val 


Asp Arg 


Leu Glu Arg 


Glu 


Val 


Asp 


Tyr 


Leu 


Glu 


Thr 


Gin 


Asn 


65 




70 








75 










80 


Pro Ala 


Leu Pro 


Cys Val Glu 


Leu 


Asp 


Glu 


Lys 


Val 


Thr 


Gly 


Gly 


Pro 






85 






90 










95 




Gly Ala 


Lys Gly 


Lys Gly Arg 


Arg 


Asn 


Glu 


Lys 


Tyr 


Asp 


Met 


Val 


Thr 




100 






105 










110 






Asp Cys 


Ser Tyr 


Thr Val Ala 


Gin 


Val 


Arg 


Ser 


Met 


Lys 


He 


Leu 


Lys 




115 




120 










125 








Arg Phe 


Gly Gly 


Ser Val Gly 


Leu 


Tip 


Thr 


Lys 


Asp 


Pro 


Leu 


Gly Pro 


130 




135 










140 










Ala Glu 


Lys lie 


Tyr Val Leu 


Asp 


Gly 


Thr 


Gin 


Asn 


Asp 


Thr 


Ala 


Phe 


145 




150 








155 










160 


Val Phe 


Pro Arg 


Leu Arg Asp 


Phe 


Thr 


Leu 


Ala 


Met 


Ala 


Ala 


Arg 


Lys 






165 






170 










175 




Ala Ser 


Arg lie 


Arg Val Pro 


Phe 


Pro 


Trp 


Val 


Gly 


Thr 


Gly 


Gin 


Leu 




180 






185 










190 






Val Tyr 


Gly Gly 


Phe Leu Tyr 


Tyr 


Ala 


Arg 


Arg 


Pro 


Pro 


Gly 


Gly Pro 




195 




200 










205 








Gly Gly 


Gly Gly 


Glu Leu Glu 


Asn 


Thr 


Leu 


Gin 


Leu 


He 


Lys 


Phe 


His 


210 




215 










220 










Leu Ala 


Asn Arg 


Thr Val Val 


Asp 


Ser 


Ser 


Val 


Phe 


Pro 


Ala 


Glu 


Ser 


225 




230 








235 










240 


Leu lie 


Pro Pro 


Tyr Gly Leu 


Thr 


Ala 


Asp 


Thr 


Tyr 


He 


Asp 


Leu 


Ala 






245 






250 










255 




Ala Asp 


Glu Glu 


Gly Leu Trp 


Ala 


Val 


Tyr 


Ala 


Thr 


Arg 


Asp 


Asp Asp 




260 






265 










270 






Arg His 


Leu Cys 


Leu Ala Lys 


Leu 


Asp 


Pro 


Gin 


Thr 


Leu 


Asp 


Thr 


Glu 




275 




280 










285 








Gin Gin 


Trp Asp 


Thr Pro Cys 


Pro 


Arg 


Glu 


Asn 












290 




295 





















<210> 100 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 100 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgtcggggag tgctcggtgc 300 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 
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gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 

tctttgct 728 

<210> 101 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 101 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 3 60 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 4 80 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 

tctttgct 728 

<210> 102 
<211> 243 
<212> PRT 

<213> Homo sapiens 
<400> 102 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 

1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly His Pro 

20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 

35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Val Pro Gly Ala Pro Gly Glu 

50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 

85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 

100 105 110 

Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg Val 

115 120 125 

Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser lie Ala 

165 170 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 
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Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser lie Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 103 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 103 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 24 0 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggtgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

999tgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 104 
<211> 243 
<212> PRT 
<213> Homo sapiens 



<400> 104 



Met 


Arg 


Pro 


Leu 


Leu 


Val 


Leu 


Leu 


Leu 


Leu 


Gly Leu 


Ala 


Ala 


Gly 


Ser 


1 








5 










10 








15 




Pro 


Pro 


Leu 


Asp 


Asp 


Asn 


Lys 


He 


Pro 


Ser 


Leu Cys 


Pro 


Gly 


His 


Pro 








20 










25 








30 






Gly 


Leu 


Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser Gin 


Gly 


Leu 


Pro 


Gly 






35 










40 








45 








Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro Gly 


Ala 


Pro 


Gly 


Glu 




50 










55 








60 










Lys 


Gly 


Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly Pro 


Arg 


Gly 


Asp 


Pro 


65 










70 










75 








80 


Gly 


Pro 


Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Ala 


Gly 


Pro Thr 


Gly 


Pro 


Val 


Gly 










85 










90 








95 




Glu 


Cys 


Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser Ala 


Lys 


Arg 


Ser 


Glu 








100 










105 








110 






Ser 


Arg 


Val 


Pro 


Pro 


Pro 


Ser 


Asp 


Ala 


Pro 


Leu Pro 


Phe 


Asp 


Arg 


Val 
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115 










120 




125 






Leu 


Val 


Asn 


Glu 


Gin 


Gly 


His 


Tyr Asp 


Ala 


Val Thr Gly Lys 


Phe 


Thr 




130 










135 






140 






Cys 


Gin 


Val 


Pro 


Gly 


Val 


Tyr 


Tyr Phe 


Ala 


Val His Ala Thr 


Val 


Tvr 


145 










150 








155 




160 


Arg 


Ala 


Ser 


Leu 


Gin 


Phe 


Asp 


Leu Val 


Lys 


Asn Gly Glu Ser 


He 


Ala 










165 








170 




X / -J 




Ser 


Phe 


Phe 


Gin 


Phe 


Phe 


Gly 


Gly Trp 


Pro 


Lys Pro Ala Ser 


Leu 


Ser 








180 








185 




190" 

J- ^ V 






Gly 


Gly 


Ala 


Met 


Val 


Arg 


Leu 


Glu Pro 


Glu 


Asp Gin Val Trp 


Val 


Gin 






195 










200 




205 






Val 


Gly 


Val 


Gly 


Asp 


Tyr 


He 


Gly He 


Tyr 


Ala Ser He Lys 


Thr 


Asp 




210 










215 






220 




Ser 


Thr 


Phe 


Ser 


Gly 


Phe 


Leu 


Val Tyr 


Ser 


Asp Trp His Ser 


Ser 


Pro 


225 










230 








235 




240 


Val 


Phe 


Ala 





















<210> 105 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 105 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgt cggggagtgc 480 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 84 0 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 106 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 106 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 

1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 
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Gly Leu Pro 


Gly 


Thr 


Pro 


Gly 


His His 


Gly Ser Gin Gly Leu Pro Gly 


35 










40 


45 


Arg Asp Gly 


Arg 


Asp 


Gly 


Arg 


Asp Gly 


Ala Pro Gly Ala Pro Gly Glu 


50 








55 




60 


Lys Gly Glu 


Gly 


Gly 


Arg 


Pro 


Gly Leu 


Pro Gly Pro Arg Gly Asp Pro 


65 






70 






75 80 


Gly Pro Arg 


Gly 


Glu 


Ala 


Gly 


Pro Ala 


Gly Pro Thr Gly Pro Ala Gly 






85 








90 95 


Glu Cys Ser 


Val 


Pro 


Pro 


Arg 


Ser Ala 


Phe Ser Ala Lys Arg Ser Glu 




100 








105 


110 


Ser Arg Val 


Pro 


Pro 


Pro 


Ser 


Asp Ala 


Pro Leu Pro Phe Asp Arg Val 


115 










120 


125 


Leu Ala Asn 


Glu 


Gin 


Gly 


His 


Tyr Asp 


Ala Val Thr Gly Lys Phe Thr 


130 








135 




140 


Cys Gin Val 


Pro 


Gly 


Val 


Tyr 


Tyr Phe 


Ala Val His Ala Thr Val Tyr 


145 






150 






155 160 


Arg Ala Ser 


Leu 


Gin 


Phe 


Asp 


Leu Val 


Lys Asn Gly Glu Ser lie Ala 






165 








170 175 


Ser Phe Phe 


Gin 


Phe 


Phe 


Gly 


Gly Trp 


Pro Lys Pro Ala Ser Leu Ser 




180 








185 


190 


Gly Gly Ala 


Met 


Val 


Arg 


Leu 


Glu Pro 


Glu Asp Gin Val Trp Val Gin 


195 










200 


205 


Val Gly Val 


Gly 


Asp 


Tyr 


lie 


Gly lie 


Tyr Ala Ser lie Lys Thr Asp 


210 








215 




220 


Ser Thr Phe 


Ser 


Gly 


Phe 


Leu 


Val Tyr 


Ser Asp Trp His Ser Ser Pro 


225 






230 






235 240 



Val Phe Ala 



<210> 107 
<211> 1338 
<212> DNA 

<213> Homo sapiens 
<400> 107 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggcgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 
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<210> 108 

<211> 243 

<212> PRT 

<213> Homo sapiens 



<400> 108 



Met Arg 


Pro 


Leu 


Leu 


Val 


Leu 


Leu 


Leu 


Leu 


Gly Leu 


Ala 


Ala 


Gly 


Ser 


1 






5 










10 








15 




Pro Pro 


Leu 


Asp 


Asp 


Asn 


Lys 


lie 


Pro 


Ser 


Leu Cys 


Pro 


Gly 


His 


Pro 






20 










25 








30 






Gly Leu 


Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser Gin 


Gly 


Leu 


Pro 


Gly 




35 










40 








45 








Arg Asp 


Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro Gly 


Ala 


Pro 


Gly 


Glu 


50 










55 








60 










Lys Gly 


Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly Pro 


Arg 


Gly 


Asp 


Pro 


65 








70 










75 








80 


Gly Pro 


Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Ala 


Gly 


Pro Thr 


Gly 


Pro 


Ala 


Gly 








85 










90 








95 




Glu Cys 


Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser Ala 


Lys 


Arg 


Ser 


Glu 






100 










105 








110 






Ser Arg 


Val 


Pro 


Pro 


Pro 


Ser 


Asp 


Ala 


Pro 


Leu Pro 


Phe 


Asp 


Arg 


Val 




115 










120 








125 








Leu Val 


Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Ala 


Val Thr 


Gly 


Lys 


Phe 


Thr 


130 










135 








140 










Cys Gin 


Val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val His 


Ala 


Thr 


Val 


Tyr 


145 








150 










155 








160 


Arg Ala 


Ser 


Leu 


Gin 


Phe 


Asp 


Leu 


Val 


Lys 


Asn Gly 


Glu 


Ser 


Leu 


Ala 








165 










170 








175 




Ser Phe 


Phe 


Gin 


Phe 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys Pro 


Ala 


Ser 


Leu 


Ser 






180 










185 








190 






Gly Gly 


Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp Gin 


Val 


Trp 


Val 


Gin 




195 










200 








205 








Val Gly 


Val 


Gly 


Asp 


Tyr 


lie 


Gly 


lie 


Tyr 


Ala Ser 


lie 


Lys 


Thr 


Asp 


210 










215 








220 










Ser Thr 


Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp Trp 


His 


Ser 


Ser 


Pro 


225 








230 










235 








240 



Val Phe Ala 



<210> 109 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 109 

gtcgacccac gcgtccggga 

agcgcggggg ctggagcacc 

gccatcgggg agccgggagg 

cagcgctatg aggccactcc 

actggacgac aacaagatcc 

gggccaccat ggcagccagg 

gcccggggct ccgggagaga 

gaccccgggc cgcgaggaga 

tcggtgcctc cgcgatccgc 

tctgacgcac ccttgccctt 

gtcaccggca agttcacctg 

gtctaccggg ccagcctgca 

ttccagtttt tcggggggtg 



ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 

cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtttgatctg gtgaagaatg gcgaatccct tgcctctttc 720 

gcccaagcca gcctcgctct cggggggggc catggtgagg 780 
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ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 84 0 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

999tgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 110 
<211> 406 
<212> PRT 

<213> Homo sapiens 



<400> 110 
Met Gly Pro Ser 
1 

Gly Pro Leu Gin 
20 

Arg Leu Ala Ala 

35 

Ser Arg His Ala 
50 

Leu Leu Glu Val 
65 

Asp Thr lie Ser 

Leu Glu Thr Gin 
100 

Val Thr Gly Gly 
115 

Tyr Asp Met Val 
130 

Met Lys He Leu 
145 

Asp Pro Leu Gly 

Asn Asp Thr Ala 
180 

Met Ala Ala Arg 
195 

Gly Thr Gly Gin 
210 

Pro Pro Gly Arg 
225 

Leu He Lys Phe 

Phe Pro Ala Glu 
260 

Tyr He Asp Leu 
275 

Thr Arg Glu Asp 
290 

Thr Leu Asp Thr 
305 

Ala Glu Ala Ala 



Thr 


Pro 


Leu 


Leu 


5 








Gly 


Gin 


Gin 


His 


Leu 


Glu 


Glu 


Arg 








40 


Ala 


Glu 


Leu 


Arq 






55 




Ala 


Glu 


Lys 


Glu 




70 






Gly 


Arq 


Val 


Asr> 


85 








Asn 


Pro 


Ala 


Leu 


Pro 


Gly 


Thr 


Lvs 








120 


Thr 


Asp 


Cys 


Glv 






135 




T.uq 
J-i y a 






uiy 




150 






Gin 


Thr 


Glu 


Lys 


165 








Phe 


Val 


Phe 


Pro 


Lys 


Ala 


Ser 


Arg 








200 


Leu 


Val 


Tyr 


Gly 






215 




Pro 


Gly 


Gly 


Gly 




230 






His 


Leu 


Ala 


Asn 


245 








Gly 


Leu 


He 


Pro 


Ala 


Ala 


Asp 


Glu 








280 


Asp 


Arg 


His 


Leu 






295 




Glu 


Gin 


Gin 


Trp 




310 






Phe 


val 


He 


Cys 



He 


Leu 


Phe Leu 




10 




His 


Leu 


Val Glu 


25 






Leu 


Ala 


Gin Cys 


Aso 


Phe 


Lys Asn 






60 


Ara 


Glu 


Ala Leu 






75 


Arq 


Leu 


Glu Arg 




90 




Pro 


Cys 


Val Glu 


105 






Gly 


Lys 


Gly Arg 


Tvr 


Thr 


lit O C. X 






140 




riu 


Ala rjl \/ 
Hid VJly 






155 


He 


Tyr 


Val Leu 




170 




Arg 


Leu 


Arg Asp 


185 






Val 


Arg 


Val Pro 


Gly 


Phe 


Leu Tyr 






220 


Gly 


Glu 


Met Glu 






235 


Arg 


Thr 


Val Val 




250 




Pro 


Tyr 


Gly Leu 


265 






Glu 


Gly 


Leu Trp 


Cys 


Leu 


Ala Lys 






300 


Asp 


Thr 


Pro Cys 






315 


Gly 


Thr 


Leu Tyr 



Leu 


Ser Trp 


C q y 




15 




Tvr 




A irrt 




30 




Gin 

will 


A an Hi n 




45 






Lys 


Met Leu 


Pro 


A TCI 


1 111. U1U 


Ala 

t\JL a. 






ftO 

O w 


V7 J. W 


Ala Sen 
nl cl nop 


Tyr 








Php 

xriitz 


Aon C2. 1 11 
nop UlU 


Lys 




Tin 

llU 




Arg 


A art O T n 
M.y 11 UJ.U 


Lys 


125 






Gin 


Val Arg 


Ser 


Leu 


Trp Thr 


Lys 






low 


Asp 


Gly Thr 


Gin 




175 




Phe 


Thr Leu 


Ala 




190 




Phe 


Pro Trp 


Val 


205 






Phe 


Ala Arg 


Arg 


Asn 


Thr Leu 


Gin 






240 


Asp 


Ser Ser 


Val 




255 




Thr 


Ala Asp 


Thr 




270 




Ala 


Val Tyr 


Ala 


285 






Leu 


Asp Pro 


Gin 


Pro 


Arg Glu 


Asn 






320 


Val 


Val Tyr 


Asn 
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325 






330 




335 


Thr Arg Pro Ala 


Ser 


Arg 


Ala Arg lie 


Gin 


Cys 


Ser Phe Asp Ala Ser 


340 






345 






350 


Gly Thr Leu Thr 


Pro 


Glu 


Arg Ala Ala 


Leu 


Pro 


Tyr Phe Pro Arg Arg 


355 






360 






365 


Tyr Gly Ala His 


Ala 


Ser 


Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 


370 






375 






380 


Tyr Ala Trp Asp 


Asp 


Gly 


Tyr Gin lie 


Val 


Tyr 


Lys Leu Glu Met Arg 


385 




390 






395 


400 


Lys Lys Glu Glu 


Glu 
405 


Val 











<210> 111 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 111 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 300 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggcag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 112 
<211> 406 
<212> PRT 
<213> Homo sapiens 

<400> 112 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 
15 10 15 
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Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His His 


Leu 


Val 


Glu 


Tyr 


Met 


Glu 


Arg 








20 








25 










30 






Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg Leu 


Ala 


Gin 


Cys 


Gin 


Asp 


Gin 


Ser 






35 










40 








45 








Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg Asp 


Phe 


Lys 


Asn 


Lys 


Met 


Leu 


Pro 




50 










55 








60 










Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu Arg 


Glu 


Ala 


Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 








75 










80 


Asp 


Thr 


He 


Ser Gly 


Arg 


Val 


Asp Arg 


Leu 


Glu 


Arg 


Glu 


Val 


Asp 


Tyr 










85 








90 










95 




Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu Pro 


Cys 


Val 


Glu 


Phe 


Asp 


Glu 


Lys 








100 








105 










110 






Val 


Thr 


Gly 


Gly Pro 


Gly 


Thr 


Lys Gly 


Lys 


Gly 


Arg 


Arg 


Asn 


Glu 


Lys 






115 










120 








125 








Tyr 


Asp 


He 


Val 


Thr 


Asp 


Cys 


Gly Tyr 


Thr 


He 


Ser 


Gin 


Val 


Arg 


Ser 




130 










135 








140 










Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly Gly 


Pro 


Ala 


Gly 


Leu 


Trp 


Thr 


Lys 


145 










150 








155 










160 


Asp 


Pro 


Leu 


Gly Gin 


Thr 


Glu 


Lys He 


Tyr 


Val 


Leu 


Asp 


Gly 


Thr 


Gin 










165 








170 










175 




Asn 


Asp 


Thr 


Ala 


Phe 


Val 


Phe 


Pro Arg 


Leu 


Arg 


Asp 


Phe 


Thr 


Leu 


Ala 








180 








185 










190 






Met 


Ala 


Ala 


Ara 


Lys 


Ala 


Ser 


Arg Val 


Arg 


Val 


Pro 


Phe 


Pro 


Trp 


Val 






195 










200 








205 








Gly 


Thr 


Gly 


Gin 


Leu 


Val 


Tyr 


Gly Gly 


Phe 


Leu 


Tyr 


Phe 


Ala 


Arg 


Arg 




210 










215 








220 










Pro 


Pro 


Gly 


Arg 


Pro 


Gly 


Gly 


Gly Gly 


Glu 


Met 


Glu 


Asn 


Thr 


Leu 


Gin 


225 










230 








235 










240 


Leu 


He 


Lys 


Phe 


His 


Leu 


Ala 


Asn Arg 


Thr 


Val 


Val 


Asp 


Ser 


Ser 


Val 










245 








250 










255 




Phe 


Pro 


Ala 


Glu 


Gly 


Leu 


lie 


Pro Pro 


Tyr 


Gly 


Leu 


Thr 


Ala 


Asp 


Thr 








260 








265 










270 






Tyr 


He 


Asp 


Leu 


Ala 


Ala 


Asp 


Glu Glu 


Gly 


Leu 


Trp 


Ala 


Val 


Tyr 


Ala 






275 










280 








285 








Thr 


Arg 


Glu 


Asp 


Asp 


Arg 


His 


Leu Cys 


Leu 


Ala 


Lys 


Leu 


Asp 


Pro 


Gin 




290 










295 








300 










Thr 


Leu 


Asp 


Thr 


Glu 


Gin 


Gin 


Trp Asp 


Thr 


Pro 


Cys 


Pro 


Arg 


Glu 


Asn 


305 










310 








315 










320 


Ala 


Glu 


Ala 


Ala 


Phe 


Val 


lie. 


Cys Gly 


Thr 


Leu 


Tyr 


Val 


Val 


Tyr 


Asn 










325 








330 










335 




Thr 


Arg 


Pro 


Ala 


Ser 


Arg 


Ala 


Arg He 


Gin 


Cys 


Ser 


Phe 


Asp 


Ala 


Ser 








340 








345 










350 






Gly 


Thr 


Leu 


Thr 


Pro 


Glu 


Arg 


Ala Ala 


Leu 


Pro 


Tyr 


Phe 


Pro 


Arg 


Arg 






355 










360 








365 








Tyr 


Gly 


Ala 


His 


Ala 


Ser 


Leu 


Arg Tyr 


Asn 


Pro 


Arg 


Glu 


Arg 


Gin 


Leu 




370 










375 








380 










Tyr 


Ala 


Trp 


Asp 


Asp 


Gly 


Tyr 


Gin He 


Val 


Tyr 


Lys 


Leu 


Glu 


Met 


Arg 


385 










390 








395 










400 


Lys 


Lys 


Glu 


Glu 


Glu 


Val 





















405 



<210> 113 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 113 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 
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tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 24 0 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatata 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc * 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 114 
<211> 406 
<212> PRT 

<213> Homo sapiens 



<400> 114 



Met Gly 


Pro 


Ser 


Thr Pro Leu Leu 


He Leu 


Phe Leu 


Leu 


Ser 


Trp 


Ser 


1 






5 


10 








15 




Gly Pro 


Leu 


Gin 


Gly Gin Gin His 


His Leu 


Val Glu 


Tyr 


Met 


Glu 


Arg 






20 




25 






30 






Arg Leu 


Ala 


Ala 


Leu Glu Glu Arg 


Leu Ala 


Gin Cys 


Gin 


Asp 


Gin 


Ser 




35 




40 






45 








Ser Arg 


His 


Ala 


Ala Glu Leu Arg 


Asp Phe 


Lys Asn 


Lys 


Met 


Leu 


Pro 


50 






55 




60 










Leu Leu 


Glu 


Val 


Ala Glu Lys Glu 


Arg Glu 


Ala Leu 


Arg 


Thr 


Glu 


Ala 
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70 




75 








80 


Asp Thr 


He 


Ser 


Gly Arg Val Asp 


Arg Leu 


Glu Arg 


Glu 


Val 


Asp 


Tyr 








85 


90 








95 




Leu Glu 


Thr 


Gin 


Asn Pro Ala Leu 


Pro Cys 


Val Glu 


Phe 


Asp 


Glu 


Lys 






100 




105 






110 






Val Thr 


Gly 


Gly 


Pro Gly Thr Lys 


Gly Lys 


Gly Arg 


Arg 


Asn 


Glu 


Lys 




115 




120 






125 








Tyr Asp 


Met 


Val 


Thr Asp Cys Gly 


Tyr Thr 


He Ser 


Gin 


Val 


Arg 


Ser 


130 






135 




140 










Met Lys 


He 


Leu 


Lys Arg Phe Gly 


Gly Pro 


Ala Gly 


He 


Trp 


Thr 


Lys 


145 






150 




155 








160 


Asp Pro 


Leu 


Gly 


Gin Thr Glu Lys 


He Tyr 


Val Leu 


Asp 


Gly 


Thr 


Gin 
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165 








170 






175 


Asn 


Asp Thr Ala 


Phe 


Val 


Phe 


Pro Arg 


Leu 


Arg Asp 


Phe Thr 


Leu Ala 




180 








185 






190 




Met 


Ala Ala Arg 


Lys 


Ala 


Ser 


Arg Val 


Arg 


Val Pro 


Phe Pro 


Trp Val 




195 








200 






205 




Gly 


Thr Gly Gin 


Leu 


Val 


Tyr 


Gly Gly 


Phe 


Leu Tyr 


Phe Ala 


Arg Arg 




210 






215 






220 






Pro 


Pro Gly Arg 


Pro 


Gly 


Gly 


Gly Gly 


Glu 


Met Glu 


Asn Thr 


Leu Gin 


225 






230 








235 




240 


Leu 


lie Lys Phe 


His 


Leu 


Ala 


Asn Arg 


Thr 


Val Val 


Asp Ser 


Ser Val 






245 








250 






255 


Phe 


Pro Ala Glu 


Gly 


Leu 


He 


Pro Pro 


Tyr 


Gly Leu 


Thr Ala 


Asp Thr 




260 








265 






270 


Tyr 


lie Asp Leu 


Ala 


Ala 


Asp 


Glu Glu 


Gly 


Leu Trp Ala Val 


Tyr Ala 




275 








280 






285 




Thr 


Arg Glu Asp 


Asp 


Arg 


His 


Leu Cys 


Leu 


Ala Lys 


Leu Asp Pro Gin 




290 






295 






300 






Thr 


Leu Asp Thr 


Glu 


Gin 


Gin 


Trp Asp 


Thr 


Pro Cys 


Pro Arg Glu Asn 


305 






310 








315 




320 


Ala 


Glu Ala Ala 


Phe 


Val 


He 


Cys Gly 


Thr 


Leu Tyr 


Val Val 


Tyr Asn 






325 








330 






335 


Thr 


Atg Pro Ala 


Ser 


Arg 


Ala 


Arg He 


Gin 


Cys Ser 


Phe Asp 


Ala Ser 




340 








345 






350 




Gly 


Thr Leu Thr 


Pro 


Glu 


Arg 


Ala Ala 


Leu 


Pro Tyr 


Phe Pro 


Arg Arg 




355 








360 






365 


Tyr 


Gly Ala His 


Ala 


Ser 


Leu 


Arg Tyr 


Asn 


Pro Arg Glu Arg Gin Leu 




370 






375 






380 






Tyr 


Ala Trp Asp 


Asp 


Gly 


Tyr 


Gin He 


Val 


Tyr Lys 


Leu Glu 


Met Arg 


385 






390 








395 




400 


Lys 


Lys Glu Glu 


Glu 


Val 


















405 

















<210> 115 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 115 
gtcgacccac gcgtccgcgg acgcgtgggt 
tactctggca ccactctcca ggctgccatg 
cttttgtcat ggtcgggacc cctccaagga 
cgccgactag ctgctttaga ggaacggctg 
gctgctgagc tgcgggactt caagaacaag 
gagcgggagg cactcagaac tgaggccgac 
cgggaggtag actatctgga gacccagaac 
aaggtgactg gaggccctgg gaccaaaggc 
gtgacagact gtggctacac aatctctcaa 
ggtggcccag ctggtatatg gaccaaggat 
ttagatggga cacagaatga cacagccttt 
gccatggctg cccggaaagc ttcccgagtc 
cagctggtat atggtggctt tctttatttt 
ggtggtgaga tggagaacac tttgcagcta 
gtggacagct cagtattccc agcagagggg 
acctacatcg acctggcagc tgatgaggaa 
gatgacaggc acttgtgtct ggccaagtta 
tgggacacac catgtcccag agagaatgct 
tatgtcgtct ataacacccg tcctgccagt 
agcggcaccc tgacccctga acgggcagca 



gaggggaaga ggctgactgt 
gggcccagca cccctctcct 
cagcagcacc accttgtgga 
gcccagtgcc aggaccagag 
atgctgccac tgctggaggt 
accatctccg ggagagtgga 
ccagctctgc cctgtgtaga 
aagggaagaa ggaatgagaa 
gtgagatcaa tgaagattct 
ccactggggc aaacagagaa 
gtcttcccaa ggctgcgtga 
cgggtgccct tcccctgggt 
gctcggaggc ctcctggaag 
atcaaattcc acctggcaaa 
ctgatccccc cctacggctt 
ggtctttggg ctgtctatgc 
gatccacaga cactggacac 
gaggctgcct ttgtcatctg 
cgggcccgca tccagtgctc 
ctcccttatt ttccccgcag 



acgttccttc 60 

catcttgttc 120 

gtacatggaa 180 

tagtcggcat 240 

ggcagagaag 300 

tcgtctggag 360 

gtttgatgag 420 

gtacgatatg 480 

gaagcgattt 540 

gatctacgtg 600 

cttcaccctt 660 

aggcacaggg 720 

acctggtgga 780 

ccgaacagtg 84 0 

gacagcagac 900 

cacccgggag 960 

agagcagcag 1020 

tgggaccctc 1080 

ctttgatgcc 114 0 

atatggtgcc 1200 
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catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 116 
<211> 406 
<212> PRT 

<213> Homo sapiens 



<400> 116 



Met Gly Pro 


Ser 


Thr 


Pro 


Leu 


Leu 


He 


Leu 


Phe Tien 


XJC VI 


Ser 


Trp 


Cor 
OCX 


1 














1 0 
X u 








15 




Glv Pro Leu 


Gin 


Glv 


Gin 


Gin 


His 


His 


XIC u 


Val Gin 
vdx vjiu 


iyr 


Met 


Glu 


Arg 




20 










& 3 








30 




Arcr Leu Ala 


Ala 


Leu 


Glu 


Glu 


Ara 


Tipu 

U<3 U. 


Ala 


Gl T\ PvQ 


Gl n 

V71X1 


Asp Gin 


Ser 


J -3 










d 0 








A c 








Ser Ara His 


Ala 


Ala 


Glu 


f iPii 




A e f\ 


Jr lie 




Lys 


Met 


Leu 


Pro 


















o u 










Leu Leu Glu 


Val 


Ala 


Glu 

V7X Ul 




Gl it 


Arn 

Arg 


Gl u 


Ala T.on 


Arg 


Thr 


Glu 


Aia 


















/ D 








80 


Asd Thr lie 


Ser 


Glv 


Arcr 


Val 

v a x 




Mxg 


i-lC u. 


(^1 ii Zirr? 
VJlU niy 


Gl ii 


Val 


Asp 


Tyr 






QC 
O -? 


















95 




Leu Glu Thr 

iJt» U WJiU 1111 


Gin 


Aan 


XT X \J 


Al a 
rvx ct 


T ,oi i 
Xjc U 


Dr^% 


Cys 


Vol ulU 


rue 


Asp 


Glu 


Lys 




100 

IVv 










1UD 








110 






Val Thr Glv 

v a. x iiil uijr 


Glv 


Pro 


Glv 
J 


X I1X 


xjy o 


Gl v 


Lys 


vjiy M.ry 


Arg 


Asn 


Glu 


T irn 

Liys 


lie: 
113 










1 nn 

lz U 








125 








T\/r lien Mot"* 


v a± 


Thr 


MS p 


cys 


oxy 


Tyr 


inr 


lie oer 


oin 


Val 


Arg 


Ser 


1JU 








lob 








140 










Met Lys lie 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Pro 


Ala Gly 


Leu 


Trp 


Thr 


Lys 


145 






150 










155 








160 


Asp Pro Leu 


Gly 


Gin 


Thr 


Glu 


Lys 


lie 


Tyr 


Val Leu 


Asp 


Gly Thr 


Gin 






165 










170 








175 




Asn Asp Thr 


Val 


Phe 


Val 


Phe 


Pro 


Arg 


Leu 


Arg Asp 


Phe 


Thr 


Leu 


Ala 




180 










185 








190 






Met Ala Ala 


Arg 


Lys 


Ala 


Ser 


Arg 


Val 


Arg 


Val Pro 


Phe 


Pro Trp 


Val 


195 










200 








205 








Gly Thr Gly 


Gin 


Leu 


Val 


Tyr 


Gly 


Gly 


Phe 


Leu Tyr 


Phe 


Ala Arg 


Arg 


210 








215 








220 










Pro Pro Gly 


Arg 


Pro 


Gly 


Gly 


Gly 


Gly 


Glu 


Met Glu 


Asn 


Thr 


Leu 


Gin 


225 






230 










235 








240 


Leu lie Lys 


Phe 


His 


Leu 


Ala 


Asn 


Arg 


Thr 


Val Val 


Asp 


Ser 


Ser 


Val 






245 










250 








255 




Phe Pro Ala 


Glu 


Gly 


Leu 


He 


Pro 


Pro 


Tyr 


Gly Leu 


Thr 


Ala 


Asp 


Thr 




260 










265 








270 






Tyr lie Asp 


Leu 


Ala 


Ala 


Asp 


Glu 


Glu 


Gly 


Leu Trp 


Ala 


Val 


Tyr 


Ala 


275 










280 








285 








Thr Arg Glu 


Asp 


Asp 


Arg 


His 


Leu 


Cys 


Leu 


Ala Lys 


Leu 


Asp 


Pro 


Gin 


290 








295 








300 










Thr Leu Asp 


Thr 


Glu 


Gin 


Gin 


Trp 


Asp 


Thr 


Pro Cys 


Pro 


Arg 


Glu 


Asn 


305 






310 










315 








320 
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Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 

355 360 * 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 

<210> 117 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 117 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagtcttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 114 0 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 118 
<211> 242 
<212> PRT 

<213> Mus musculus 
<400> 118 

Met Ar.g Pro Leu Leu. Ala Leu Leu Leu Leu Gly Leu val Ser Gly Ser 
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1 

Pro Pro Leu Asp 
20 

Gly Leu Pro Gly 
35 

Arg Asp Gly Arg 
50 

Lys Gly Glu Gly 
65 

Gly Pro Arg Gly 

Cys Ser Val Pro 
100 

Arg Val Pro Pro 
115 

Leu Asn Glu Gin 
130 

Gin Val Pro Gly 
145 

Ala Ser Leu Gin 

Phe Phe Gin Tyr 
180 

Gly Ala Met Val 
195 

Gly Val Gly Asp 
210 

Thr Phe Ser Gly 
225 

Phe Ala 



A q*n Aon 
nbp ASH 


Lys 


lie 


inr r*iO 




TT -J — 

HIS 






A ft 


HSp oiy 


Arg 


ASp 




55 




Gly Arg 


Pro 


Gly 


70 






olU \j±y 


Pro 


wee 


85 






Pro Arg 


Ser 


Ala 


Fro Ala 


ASp 


Tnr 








uiy nis 


Tyr 


Asp 




135 




Val Tyr 


Tyr 


Phe 


150 






Pho A on 


UC LI 


vol 


165 






XrllC Vaxy 




Trp 


Arg Leu 


GlU 


Pro 






200 


Tyr lie 


Gly 


He 




215 




Phe Leu 


Val 


Tyr 


230 









10 




Pro 


Ser 


Leu Cys 


o c 

25 






HIS 


Gly 


Ser Gin 


/"•l Si- 
cily 


Aia 


Pro Gly 






60 


Leu 


Pro 


Gly Pro 






75 


Gly 


Ala 


He Gly 




90 




Pne 


Ser 


Ala Lys 


105 






Pro 


Leu 


Pro Phe 


Pro 


rpV| v> 


xnr Gly 






140 


Aid 


vai 


his Aia 






ICC 

155 


Lys 


Asn 


Gly Gin 




170 




Pro 


Lys 


Pro Ala 


IOC 

185 






ijlU 


Asp 


Gin val 


Tyr 


Ala 


Ser. He 






220 


Ser 


Asp 


Trp His 




235 



15 

Pro Gly Gin Pro 
30 

Gly Leu Pro Gly 
45 

Ala Pro Gly Glu 

Arg Gly Glu Pro 
80 

Pro Ala Gly Glu 
95 

Arg Ser Glu Ser 
110 

Asp Arg Val Leu 
125 

Lys Phe Thr Cys 

Thr Val Tyr Arg 
160 

Ser He Ala Ser 
175 

Ser Leu Ser Gly 
190 

Trp Val Gin Val 
205 

Lys Thr Asp Ser 

Ser Ser Pro Val 
240 



<210> 119 
<211> 1263 
<212> DNA 

<213> Mus mus cuius 
<400> 119 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgee tgectacaga ctacaagaga ggttcctgga gtctgagect 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg t caeca tggc agccaaggcc tgcctggccg -tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gegagggegg gagaceggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg tagggeccat gggggctatc gggectgegg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag ageegggtae 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggecatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agettgeagt ttgatcttgt caaaaaeggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

geatctatge cagcatcaag acagacagta ccttctctgg atttctegtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggee ccctggaata 960 

ttgtgaatga cttaggaaga gagggageca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgeagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgetcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 
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ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 
C 9 C " 1263 



<210> 120 

<211> 243 

<212> PRT 

<213> Mus musculus 



<400> 120 










Met Arg Pro 


Leu 


Leu Ala 


Leu 


Leu 


1 




5 






Pro Pro Leu 


Asp 


Asp Asn 


Lys 


He 




20 








Gly Leu Pro 


Gly 


Thr Pro 


Gly 


His 


35 








40 


Arg Asp Gly 


Arg 


Asp Gly 


Arg 


Asp 


50 






55 




Lys Gly Glu 


Gly 


Gly Arg 


Pro 


Gly 


65 




70 






Gly Pro Arg 


Gly 


Glu Ala 


Gly 


Pro 






85 






Glu Cys Ser 


Val 


Pro Pro 


Arg 


Ser 




100 








Ser Arg Val 


Pro 


Pro Pro 


Ala 


Asp 


115 








120 


Leu Leu Asn 


Glu 


Gin Gly 


His 


Tyr 


130 






135 




Cys Gin Val 


Pro 


Gly Val 


Tyr 


Tyr 


145 




150 






Arg Ala Ser 


Leu 


Gin Phe 


Asp 


Leu 






165 






Ser Phe Phe 


Gin 


Tyr Phe 


Gly 


Gly 




180 








Gly Gly Ala 


Met 


Val Arg 


Leu 


Glu 


195 








200 


Val Gly Val 


Gly 


Asp Tyr 


He 


Gly 


210 






215 




Ser Thr Phe 


Ser 


Gly Phe 


Leu 


Val 


225 




230 






Val Phe Ala 











Leu 


Leu 


Gly Leu Val 


Ser Gly Ser 




10 




15 


Pro 


Ser 


Leu Cys Pro 


Gly Gin Pro 


25 






30 


His 


Gly 


Ser Gin Gly 


Leu Pro Gly 






45 




Gly 


Ala 


Pro Gly Ala 


Pro Gly Glu 






60 




Leu 


Pro 


Gly Pro Arg 


Gly Glu Pro 






75 


80 


Met 


Gly 


Ala He Gly 


Pro Ala Gly 




90 




95 


Ala 


Phe 


Ser Val Lys 


Arg Ser Glu 


105 






110 


Thr 


Pro 


Leu Pro Phe 


Asp Arg Val 






125 




Asp 


Pro 


Thr Thr Gly 


Lys Phe Thr 






140 




Phe 


Ala 


Val His Ala 


Thr Val Tyr 






155 


160 


Val 


Lys 


Asn Gly Gin 


Ser He Ala 




170 




175 


Trp 


Pro 


Lys Pro Ala 


Ser Leu Ser 


185 






190 


Pro 


Glu 


Asp Gin Val 


Trp Val Gin 






205 




He 


Tyr 


Ala Ser He 


Lys Thr Asp 






220 




Tyr 


Ser 


Asp Trp His 


Ser Ser Pro 






235 


240 



<210> 121 
<211> 1263 
<212> DNA 

<213> Mus musculus 
<400> 121 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagtct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 
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acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 12 00 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 
cgc _ • 

<210> 122 
<211> 243 
<212> PRT 

<213> Mus musculus 



<400> 122 



Met Arg Pro 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Glv 

oxy 


Leu 


Val 


Cpr 


wrxy 


OC JL 


1 




5 










10 










15 




Pro Pro Leu 


Asp 


Asp 


Asn 


Lys 


He 


tr A. \J 


wCJ. 






rlU 


JLy 




Ova 




20 










25 
















Glv Tit^u Pr*rt 


Glv 


Thr 


nu 


vj j. y 


His 




vj J»y 


OCX 




o JLy 


Leu 


Pro 


vj±y 


35 










40 






















7\ en 




Arg 


Ben 
Map 


Va J.y 


Aid 


Pro 




Aia 


Pro 


\jjLy 


vjxU 










5 5 










60 










Lys Gly Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Arg 


Gly 


Glu 


Pro 


65 






70 










75 










80 


Gly Pro Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Met 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 






85 










90 










95 




Glu Cys Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 




100 










105 










110 






Ser Arg Val 


Pro 


Pro 


Pro 


Ala 


Asp 


Thr 


Pro 


Leu 


Pro 


Phe 


Asp 


Arg 


Ala 


115 










120 










125 








Leu Leu Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Pro 


Thr 


thr 


Gly 


Lys 


Phe 


Thr 


130 








135 










140 










Cys Gin val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val 


His 


Ala 


Thr 


Val 


Tyr 


145 






150 










155 










160 


Arg Ala Ser 


Leu 


Gin 


Phe 


Asp 


Leu 


Val 


Lys 


Asn 


Gly 


Gin 


Ser 


He 


Ala 






165 










170 










175 




Ser Phe Phe 


Gin 


Tyr 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 




180 










185 










190 






Gly Gly Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp 


Gin 


Val 


Trp 


Val 


Gin 


195 










200 










205 








Val Gly Val 


Gly 


Asp 


Tyr 


He 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 


210 








215 










220 










Ser Thr Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 


225 






230 










235 










240 


Val Phe Ala 





























210> 123 
211> 1263 
212> DNA 

213> Mus musculus 
400> 123 
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gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga' ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 4 80 

ctccgccagc cgacacaccc ctacctttcg accgtgcgct gctaaatgag cagggccatt 54 0 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

C 9 C 1263 

<210> 124 
<211> 243 
<212> PRT 

<213> Mus musculus 



<400> 124 



Met Arg 


Pro 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Gly Leu 


Val 


Ser Gly 


Ser 


1 






5 










10 








15 




Pro Pro 


Leu 


Asp 


Asp 


Asn 


Lys 


lie 


Pro 


Ser 


Leu Cys 


Pro 


Gly Gin 


Pro 






20 










25 








30 






Gly Leu 


Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser Gin 


Gly 


Leu 


Pro 


Gly 




35 










40 








45 






Arg Asp 


Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro Gly 


Ala 


Pro 


Gly 


Glu 


50 










55 








60 








Lys Gly 


Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly Pro 


Arg 


Gly Glu 


Pro 


65 








70 










75 








80 


Gly Pro 


Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Met 


Gly 


Ala He 


Gly 


Pro 


Ala 


Gly 








85 










90 








95 




Glu Cys 


Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser Ala 


Lys 


Arg 


Ser 


Glu 






100 










105 








110 






Ser Arg 


Val 


Pro 


Pro 


Pro 


Ala 


Asp 


Thr 


Pro 


Leu Pro 


Phe 


Asp Arg 


Val 




115 










120 








125 








Leu Leu 


Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Pro 


Thr Thr 


Gly 


Lys 


Phe 


Thr 


130 










135 








140 










Cys Gin 


Val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val His 


Ala 


Thr 


Val 


Tyr 


145 








150 










155 








160 


Arg Aid 


Ser 


Leu 


Gin 


Phe 


Asp 


He 


Val 


Lys 


Asn Gly 


Gin 


Ser 


He 


Ala 








165 










170 








175 




Ser Phe 


Phe 


Gin 


Tyr 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys Pro 


Ala 


Ser 


Leu 


Ser 






180 










185 








190 






Gly Gly 


Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp Gin 


Val 


Trp 


Val 


Gin 




195 










200 








205 








Val Gly 


Val 


Gly 


Asp 


Tyr 


lie 


Gly 


lie 


Tyr 


Ala Ser 


lie 


Lys 


Thr 


Asp 


210 










215 








220 








Ser Thr 


Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp Trp 


His 


Ser 


Ser 


Pro 
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225 230 235 240 

Val Phe Ala 



<210> 125 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 125 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatattgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

cgc . 1263 

<210> 126 
<211> 406 
<212> PRT 

<213> Mus musculus 



<400> 126 



Met 


Gly 


Pro 


Ser 


Ala 


Pro 


Leu 


Leu 


Leu 


Leu 


Phe 


Phe 


Leu 


Ser 


Trp 


Thr 


1 








5 










10 










15 




Gly 


Pro 


Leu 


Gin 
20 


Gly 


Gin 


Gin 


His 


His 
25 


Leu 


Val 


Glu 


Tyr 


Met 
30 


Glu 


Arg 


Arg 


Leu 


Ala 
35 


Ala 


Leu 


Glu 


Glu 


Arg 
40 


Leu 


Ala 


Gin 


Cys 


Gin 
45 


Asp 


Gin 


Ser 


Ser 


Arg 
50 


His 


Ala 


Ala 


Glu 


Leu 
55 


Arg 


Asp 


Phe 


Lys 


Asn 
60 


Lys 


Met 


Leu 


Pro 


Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu 


Arg 


Glu 


Thr 


Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 










75 










80 


Asp 


Ser 


He 


Ser 


Gly 
85 


Arg 


Val 


Asp 


Arg 


He 
90 


Glu 


Arg 


Glu 


Val 


Asp 
95 


Tyr 


Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu 


Pro 


Cy3 


Val 


Glu 


Leu 


Asp 


Glu 


Lys 








100 










105 










110 




Val 


Thr 


Gly 
115 


Gly 


Pro 


Gly 


Ala 


Lys 
120 


Gly 


Lys 


Gly 


Arg 


Arg 
125 


Asn 


Glu 


Lys 


Tyr 


ASp 
130 


Met 


Val 


Thr 


Asp 


Cys 
135 


Ser 


Tyr 


Thr 


Val 


Ala 
140 


Gin 


Val 


Arg 


Ser 


Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Ser 


Val 


Gly 


Leu 


Trp 


Thr 


Lys 
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145 


150 








155 160 


Asp Pro Leu 


Gly Pro Ala 


Glu 


Lys He 


Tyr 


Val Leu Asp Gly Thr Gin 




165 






170 


175 


Asn Asp Thr 


Ala Phe Val 


Phe 


Pro Arg 


Leu 


Arg Asp Phe Thr Leu Ala 




180 




185 




190 


Met Ala Ala 


Arg Lys Ala 


Ser 


Arg He 


Arg 


Val Pro Phe Pro Trp Val 


195 






200 




205 


Gly Thr Gly 


Gin Leu Val 


Tyr 


Gly Gly 


Phe 


Leu Tyr Tyr Ala Arg Arg 


210 




215 






220 


Pro Pro Gly 


Gly Pro Gly 


Gly 


Gly Gly 


Glu 


Leu Glu Asn Thr Leu Gin 


225 


230 








235 240 


Leu lie Lys 


Phe His Leu 


Ala 


Asn Arg 


Thr 


Val Val Asp Ser Ser Val 




245 






250 


255 


Phe Pro Ala 


Glu Ser Leu 


He 


Pro Pro 


Tyr 


Gly Leu Thr Ala Asp Thr 




260 




265 




270 


Tyr lie Asp 


Leu Ala Ala 


Asp 


Glu Glu 


Gly 


Leu Trp Ala Val Tyr Ala 


2 75 






280 




285 


Thr Arg Asp 


Asp Asp Arg 


His 


Leu Cys 


Leu 


Ala Lys Leu Asp Pro Gin 


290 




295 






300 


Thr Leu Asp 


Thr Glu Gin 


Gin 


Trp Asp 


Thr 


Pro Cys Pro Arg Glu Asn 


305 


310 








315 320 


Ala Glu Ala 


Ala Phe Val 


He 


Cys Gly 


Thr 


Leu Tyr Val Val Tyr Asn 




325 






330 


335 


Thr Arg Pro 


Ala Ser Arg 


Ala 


Arg He 


Gin 


Cys Ser Phe Asp Ala Ser 




340 




345 




350 


Gly Thr Leu 


Ala Pro Glu 


Arg 


Ala Ala 


Leu 


Ser Tyr Phe Pro Arg Arg 


355 






360 




365 


Tyr Gly Ala 


His Ala Ser 


Leu 


Arg Tyr 


Asn 


Pro Arg Glu Arg Gin Leu 


370 




375 






380 


Tyr Ala Trp 


Asp Asp Gly 


Tyr 


Gin lie 


Val 


Tyr Lys Leu Glu Met Lys 


385 


390 








395 400 


Lys Lys Glu 


Glu Glu Val 












405 











<210> 127 

<211> 1721 

<212> DNA 

<213> Mus musculus 

<400> 127 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 240 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtatt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 480 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 84 0 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 
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gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 128 
<211> 406 
<212> PRT 
<213> Mus musculus 



<400> 128 
Met Gly Pro 
1 

Gly Pro Leu 

Arg Leu Ala 
35 

Ser Arg His 
50 

Leu Leu Glu 
65 

Asp Ser lie 

Leu Glu Thr 

Val Thr Gly 
115 

Tyr Asp lie 
130 

Met Lys He 
145 

Asp Pro Leu 

Asn Asp Thr 

Met Ala Ala 
195 

Gly Thr Gly 

210 
Pro Pro Gly 
225 

Leu He Lys 

Phe Pro Ala 

Tyr He Asp 
275 

Thr Arg Asp 

290 
Thr Leu Asp 
305 



Ser Ala 
5 

Gin Gly 
20 

Ala Leu 

Ala Ala 

Val Ala 

Ser Gly 
85 

Gin Asn 
100 

Gly Pro 

Val Thr 

Leu Lys 

Gly Pro 
165 
Ala Phe 
180 

Arg Lys 

Gin Leu 

Gly Pro 

Phe His 
245 
Glu Ser 
260 

Leu Ala 
Asp Asp 
Thr Glu 



Pro Leu Leu Leu 

Gin Gin His His 
25 

Glu Glu Arg Leu 
40 

Glu Leu Arg Asp 
55 

Glu Lys Glu Arg 
70 

Arg Val Asp Arg 

Pro Ala Leu Pro 
105 

Gly Ala Lys Gly 
120 

Asp Cys Ser Tyr 
135 

Arg Phe Gly Gly 
150 

Ala Glu Lys He 

Val Phe Pro Arg 
185 

Ala Ser Arg He 
200 

Val Tyr Gly Gly 
215 

Gly Gly Gly Gly 
230 

Leu Ala Asn Arg 

Leu He Pro Pro 
265 

Ala Asp Glu Glu 
280 

Arg His Leu Cys 
295 

Gin Gin Trp Asp 
310 



Leu Phe Phe Leu Ser Trp Thr 
10 15 
Leu Val Glu Tyr Met Glu Arg 
30 

Ala Gin Cys Gin Asp Gin Ser 
45 

Phe Lys Asn Lys Met Leu Pro 
60 

Glu Thr Leu Arg Thr Glu Ala 

75 80 
Leu Glu Arg Glu Val Asp Tyr 
90 95 
Cys Val Glu Leu Asp Glu Lys 
110 

Lys Gly Arg Arg Asn Glu Lys 
125 

Thr Val Ala Gin Val Arg Ser 
140 

Ser Val Gly Leu Trp Thr Lys 
155 160 
Tyr Val Leu Asp Gly Thr Gin 
170 175 
Leu Arg Asp Phe Thr Leu Ala 
190 

Arg Val Pro Phe Pro Trp Val 
205 

Phe Leu Tyr Tyr Ala Arg Arg 
220 

Glu Leu Glu Asn Thr Leu Gin 
235 240 
Thr Val Val Asp Ser Ser Val 
250 255 
Tyr Gly Leu Thr Ala Asp Thr 
270 

Gly Leu Trp Ala Val Tyr Ala 
285 

Leu Ala Lys Leu Asp Pro Gin 
300 

Thr Pro Cys Pro Arg Glu Asn 
315 320 
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Ala Glu Ala Ala 


Phe 
325 


Val 


lie Cys 


Thr Arg Pro Ala 


Ser 


Arg 


Ala Arg 


340 








Gly Thr Leu Ala 


Pro 


Glu 


Arg Ala 


355 






360 


Tyr Gly Ala His 


Ala 


Ser 


Leu Arg 


370 






375 


Tyr Ala Trp Asp 


Asp 


Gly 


Tyr Gin 


385 




390 




Lys Lys Glu Glu 


Glu 
405 


Val 





<210> 129 

<211> 1721 

<212> DNA 

<213> Mus musculus 



Gly Thr Leu Tyr Val Val Tyr Asn 

330 335 
lie Gin Cys Ser Phe Asp Ala Ser 
345 350 
Ala Leu Ser Tyr Phe Pro Arg Arg 
365 

Tyr Asn Pro Arg Glu Arg Gin Leu 
380 

He Val Tyr Lys Leu Glu Met Lys 
395 400 



<400> 129 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atagtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 480 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 72 0 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 130 
<211> 406 
<212> PRT 
<213> Mus musculus 

<400> 130 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
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20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Ala Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala. 

180 185 ~ 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 ~ 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg lie Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 



<210> 131 

<211> 1721 

<212> DNA 

<213> Mus musculus 



<400> 131 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
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gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

at 99tgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gcgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 132 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 132 



Met 


Gly 


Pro 


Ser 


Ala 


Pro 


Leu 


Leu 


Leu 


Leu Phe Phe Leu Ser Trp Thr 


1 








5 










10 15 


Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His 


His 


Leu Val Glu Tyr Met Glu Arg 








20 










25 


30 


Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg 


Leu 


Ala Gin Cys Gin Asp Gin Ser 






35 










40 




45 


Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg 


Asp 


Phe Lys Asn Lys Met Leu Pro 




50 










55 






60 


Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu 


Arg 


Glu Thr Leu Arg Thr Glu Ala 


65 










70 








75 80 


Asp 


Ser 


He 


Ser 


Gly 


Arg 


Val 


Asp 


Arg 


Leu Glu Arg Glu Val Asp Tyr 










85 










90 95 


Leu 


Glii 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu 


Pro 


Cys Val Glu Leu Asp Glu Lys 








100 










105 


110 


Val 


Thr 


Gly 


Gly 


Pro 


Gly 


Ala 


Lys 


Gly 


Lys Gly Arg Arg Asn Glu Lys 






115 










120 




125 


Tyr 


Asp 


Met 


Val 


Thr 


Asp 


Cys 


Ser 


Tyr 


Thr Val Ala Gin Val Arg Ser 




130 










135 






140 


Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Ser Val Gly Leu Trp Thr Lys 


145 










150 








155 160 


Asp 


Pro 


Leu 


Gly 


Pro 


Ala 


Glu 


Lys 


He 


Tyr Val Leu Asp Gly Thr Gin 










165 










170 175 


Asn 


Asp 


Thr 


Ala 


Phe 


Val 


Phe 


Pro 


Arg 


Leu Arg Asp Phe Thr Leu Val 








180 










185 


190 
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Met Ala 

Gly Thr 
210 
Pro Pro 
225 

Leu lie 

Phe Pro 

Tyr lie 

Thr Arg 
290 
Thr Leu 
305 

Ala Glu 

Thr Arg 

Gly Thr 

Tyr Gly 
370 
Tyr Ala 
385 

Lys Lys 



Ala Arg 
195 

Gly Gin 

Gly Gly 

Lys Phe 

Ala Glu 
260 
Asp Leu 
275 

Asp Asp 

Asp Thr 

Ala Ala 

Pro Ala 
340 
Leu Ala 
355 

Ala His 
Trp Asp 
Glu Glu 



Lys Ala 

Leu Val 

Pro Gly 
230 
His Leu 
245 

Ser Leu 

Ala Ala 

Asp Arg 

Glu Gin 
310 
Phe Val 
325 

Ser Arg 

Pro Glu 

Ala Ser 

Asp Gly 
390 
Glu Val 
405 



Ser Arg 
200 
Tyr Gly 
215 

Gly Gly 

Ala Asn 

He Pro 

Asp Glu 
280 
His Leu 
295 

Gin Trp 

He Cys 

Ala Arg 

Arg Ala 
360 
Leu Arg 
375 

Tyr Gin 



He Arg Val Pro 
Gly Phe 
Gly Glu 



Arg Thr 
250 
Pro Tyr 
265 

Glu Gly 

Cys Leu 

Asp Thr 

Gly Thr 
330 
He Gin 
345 

Ala Leu 
Tyr Asn 
lie Val 



Leu Tyr 
220 
Leu Glu 
235 

Val Val 



Gly Leu 

Leu Trp 

Ala Lys 
300 
Pro Cys 
315 

Leu Tyr 

Cys Ser 

Ser Tyr 

Pro Arg 
380 
Tyr Lys 
395 



Phe Pro 
205 

Tyr Ala 

Asn Thr 

Asp Ser 

Thr Ala 
270 
Ala Val 
285 

Leu Asp 

Pro Arg 

Val Val 

Phe Asp 
350 
Phe Pro 
365 

Glu Arg 
Leu Glu 



Trp Val 

Arg Arg 

Leu Gin 
240 
Ser Val 
255 

Asp Thr 

Tyr Ala 

Pro Gin 

Glu Asn 
320 
Tyr Asn 
335 

Ala Ser 

Arg Arg 

Gin Leu 

Met Lys 
400 



<210> 133 
<211> 1721 
<212> DNA 

<213> Mus musculus 
<400> 133 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 480 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgtcatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

999ggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 
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aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 134 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 134 




















Met He 


Ser 


Leu 


Pro 


Gly 


Pro 


Leu 


Val 


Thr 


Asn Leu Xaa Arg Phe Leu 


1 






5 










10 




15 


Phe Leu 


Gly 


Leu 


Ser 


Ala 


Leu 


Ala 


Pro 


Pro 


Ser Arg Ala Gin Leu Gin 






20 










25 




30 




Leu His 


Leu 


Pro 


Ala 


Asn 


Arg 


Leu 


Gin 


Ala 


Val Glu Glu Gly Glu Ser 




35 










40 






45 




Gly Ala 


Ser 


Ala 


Trp 


Tyr 


Thr 


Leu 


His 


Arg 


Glu Ala Ser Ser 


Ser Gin 


50 










55 








60 




Pro Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Trp 


Phe 


Phe Lys Gin Lys 


Glu Lys 


65 








70 










75 


60 


Glu Asp 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Gly 


Val Thr Thr Ser 


Lys Pro 








85 










90 




95 


Gly Val 


Ser 


Leu 


Val 


Tyr 


Ser 


Met 


Pro 


Ser 


Arg Asn Leu Ser 


Leu Arg 






100 










105 




110 




Val Glu 


Gly 


Leu 


Gin 


Glu 


Lys 


Asp 


Ser 


Gly 


Pro Tyr Ser Cys 


Ser Val 




115 










120 






125 




Asn Val 


Gin 


Asp 


Lys 


Gin 


Gly 


Lys 


Ser 


Ara 


Gly His Ser He 


Lys Thr 


130 










135 








140 




Leu Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro Pro Ser Cys 


Arg Leu 


145 








150 










155 


160 


Gin Gly 


Val 


Pro 


His 


Val 


Gly 


Ala 


Asn 


Val 


Thr Leu Ser Cys 


Gin Ser 








165 










170 




175 


Pro Arg 


Ser 


Lys 


Pro 


Ala 


Val 


Gin 


Tyr 


Gin 


Trp Asp Arg Gin Leu Pro 






180 










185 




190 




Ser Phe 


Gin 


Thr 


Phe 


Phe 


Ala 


Pro 


Ala 


Leu 


Asp Val He Arg Gly Ser 




195 










200 






205 




Leu Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser 


Ser 


Met 


Ala Gly Val Tyr Val Cys 


210 










215 








220 




Lys Ala 


His 


Asn 


Glu 


Val 


Gly 


Thr 


Ala 


Gin 


Cys Asn Val Thr 


Leu Glu 


225 








230 










235 


240 


Val Ser 


Thr 


Gly 


Pro 


Gly 


Ala 


Ala 


Val 


Val 


Ala Glu Ala Val Val Gly 








245 










250 




255 


Thr Leu 


Val 


Gly 


Leu 


Gly 


Leu 


Leu 


Ala 


Gly 


Leu Val Leu Leu Tyr His 






260 










265 




270 




Arg Arg 


Gly 


Lys 


Ala 


Leu 


Glu 


Glu 


Pro 


Ala 


Asn Asp He Lys Glu Asp 




275 










280 






285 




Ala He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Trp 


Pro 


Lys Ser Ser Asp 


Thr He 


290 










295 








300 




Ser Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser Ala Arg Ala Leu Arg 


305 








310 










315 


320 
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Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 

325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 

340 345 350 

Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 

<210> 135 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif iedMbase 
<222> all M n M positions 
<223> n=a, c, g, or t 

<400> 135 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg cgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc ^ 1869 

<210> 136 
<211> 370 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 136 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 

1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 

20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 

35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 

50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 

85 90 95 

Gly Val Ser Leu Ala Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 

100 105 110 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 

115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 

130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
"5 150 155 * 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 

165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 

180 ^ 185 ' " 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 

195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 

210 215 22 0 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 

245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 

260 265 270 

Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 

275 280 285 

Ala lie Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 

290 295 ~ 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 ' 320 

Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 

325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 

340 345 350 

Gly Pro. Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 



<210> 137 
<211> 1869 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all f, n M positions 
<223> n=a, c, g, or t 

<400> 137 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggcctactc catgccctcc cggaacctgt 4 80 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 
actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca ' 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 

<210> 138 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 138 
Met lie Ser Leu Pro Gly Pro Leu 

1 5 
Phe Leu Gly Leu Ser Ala Leu Ala 
20 



Val Thr Asn Leu Xaa Arg Phe Leu 

* 10 15 
Pro Pro Ser Arg Ala Gin Leu Gin 
25 30 
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Leu 


His 


Leu 


Pro 


Ala Asn Arg 


Leu Gin Ala 


Val Glu 


Glu 


Gly 


Glu 


Ser 






35 










40 




45 








Gly 


Ala 


Ser 


Ala 


Trp Tyr Thr 


Leu His Arg 


Glu Val 


Ser 


Ser 


Ser 


Gin 




50 










55 




60 










Pro 


Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met Trp Phe 


Phe Lys 


Gin 


Lys 


Glu 


Lys 


65 










70 






75 








80 


Glu 


Asp 


Gin 


Val 


Leu Ser Tyr 


He Asn Gly 


Val Thr 


Thr 


Ser 


Lys 


Pro 










85 






90 








95 




Gly 


Val 


Ser 


Leu 


Val 


Tyr 


Ser 


Met Pro Ser 


Arg Asn 


Leu 


Ser 


Leu 


Arg 








100 








105 






110 






Val 


Glu 


Gly 


Leu 


Gin Glu Lys 


Asp Ser Gly 


Pro Tyr 


Ser 


Cys 


Ser 


Val 






115 










120 




125 








Asn 


Val 


Gin 


Asp 


Lys 


Gin Gly 


Lys Ser Arg 


Gly His 


Ser 


He 


Lys 


Thr 




130 










135 




140 










Leu 


Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro Pro Ala 


Pro Pro 


Ser 


Cys 


Arg 


He 


145 










150 






155 






160 


Gin 


Gly 


Val 


Pro 


His 


Val 


Gly 


Ala Asn Val 


Thr Leu 


Ser 


Cys 


Gin 


Ser 










165 






170 








175 




Pro 


Arg 


Ser 


Lys 


Pro 


Ala 


Val 


Gin Tyr Gin 


Trp Asp Arg 


Gin 


Leu 


Pro 








180 








185 






190 






Ser 


Phe 


Gin 


Thr 


Phe 


Phe 


Ala 


Pro Ala Leu 


Asp Val 


He 


Arg 


Gly 


Ser 






195 










200 




205 








Leu 


Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser Ser Met 


Ala Gly Val 


Tyr 


Val 


Cys 




210 










215 




220 








Lys 


Ala 


His 


Asn 


Glu 


Val 


Gly 


Thr Ala Gin 


Gys Asn 


Val 


Thr 


Leu 


Glu 


225 










230 






235 








240 


Val 


Ser 


Thr 


Gly 


Pro Gly Ala 


Ala Val Val 


Ala Glu 


Ala 


Val 


Val 


Gly 










245 






250 








255 


Thr 


Leu 


Val 


Gly 


Leu Gly Leu 


Leu Ala Gly 


Leu Val 


Leu 


Leu 


Tyr 


His 








260 








265 






270 






Arg 


Arg 


Gly 


Lys 


Ala 


Leu 


Glu 


Glu Pro Ala 


Asn Asp 


He 


Lys 


Glu 


Asp 






275 










280 




285 








Ala 


He 


Ala 


Pro 


Arg Thr Leu 


Pro Trp Pro 


Lys Ser 


Ser 


Asp 


Thr 


He 




290 










295 




300 










Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser Val Thr 


Ser Ala Arg 


Ala 


Leu 


Arg 


305 










310 






315 








320 


Pro 


Pro 


His 


Gly 


Pro 


Pro Arg 


Pro Gly Ala 


Leu Thr 


Pro 


Thr 


Pro 


Ser 










325 






330 








335 




Leu 


Ser 


Ser 


Gin 


Ala 


Leu 


Pro 


Ser Pro Arg 


His Ala 


His 


Asp 


Arg 


Trp 








340 








345 






350 






Gly 


Pro 


Pro 


Ser 


Thr 


Asn 


He 


Pro His Pro 


Trp Trp Gly 


Phe 


Phe 


Leu 






355 










360 




365 








Trp 


Leu 

























370 



<210> 139 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 

<400> 139 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 
gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 
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cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 4 80 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 54 0 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgta tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 90 0 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 ' 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 

<210> 140 
<211> 370 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 140 



Met 


He 


Ser 


Leu 


Pro 


Gly 


Pro 


Leu 


Val Thr Asn Leu Xaa Arg Phe Leu 


1 








5 








10 15 


Phe 


Leu 


Gly 


Leu 


Ser 


Ala 


Leu Ala 


Pro Pro Ser Arg Ala Gin Leu Gin 








20 










25 30 


Leu 


His 


Leu 


Pro 


Ala 


Asn 


Arg Leu 


Gin Ala Val Glu Glu Gly Glu Ser 






35 










40 


45 


Gly 


Ala 
50 


Ser 


Ala 


Trp 


Tyr 


Thr 
55 


Leu 


His Arg Glu Val Ser Ser Ser Gin 
60 


Pro 


Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Trp Phe Phe Lys Gin Lys Glu Lys 


65 










70 






75 80 


Glu 


Asp 


Gin 


Val 


Leu 
85 


Ser 


Tyr 


He 


Asn Gly Val Thr Thr Ser Lys Pro 
90 95 


Gly 


Val 


Ser 


Leu 
100 


Val 


Tyr 


Ser 


Met 


Pro Ser Arg Asn Leu Ser Leu Arg 
105 no 


Val 


Glu 


Gly 
115 


Leu 


Gin 


Glu 


Lys 


Asp 
120 


Ser Gly Pro Tyr Ser Cys Ser Val 
125 
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Asn 


Val 


Gin 


Asp 


Lys 


Gin 


Gly Lys 


Ser 


Arg 


Gly His 


Ser 


He 


Lys 


Thr 




130 










135 








140 










Leu 


Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro Pro 


Ser 


Cys 


Arg 


Leu 


145 










150 










155 






160 


Gin 


Gly 


Val 


Pro 


His 


Val 


Gly Ala 


Asn 


Val 


Thr Leu 


Ser 


Cys 


Gin 


Ser 










165 










170 








175 




Pro 


Arg 


Ser 


Lys 
180 


Pro 


Val 


Val 


Gin 


Tyr 
185 


Gin 


Trp Asp 


Arg 


Gin 
190 


Leu 


Pro 


Ser 


Phe 


Gin 
195 


Thr 


Phe 


Phe 


Ala 


Pro 
200 


Ala 


Leu 


Asp Val 


He 
205 


Arg 


Gly 


Ser 


Leu 


Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser 


Ser. 


Met 


Ala Gly 


Val 


Tyr 


Val 


Cys 




210 










215 








220 








Lys 


Ala 


His 


Asn 


Glu 


Val 


Gly Thr 


Ala 


Gin 


Cys Asn 


Val 


Thr 


Leu 


Glu 


nor 

225 










230 










235 








240 


Val 


Ser 


Thr 


Gly 


Pro Gly 


Ala 


Ala 


Val 


Val 


Ala Glu 


Ala 


Val 


Val 


Gly 










245 










250 








255 


Thr 


Leu 


Val 


Gly 


Leu Gly 


Leu 


Leu 


Ala 


Gly 


Leu Val 


Leu 


Leu 


Tyr 


His 








260 










265 








270 






Arg 


Arg 


Gly 


Lys 


Ala 


Leu 


Glu 


Glu 


Pro 


Ala 


Asn Asp 


He 


Lys 


Glu 


Asp 






275 










280 








285 






Ala 


lie 


Ala 


Pro 


Arg Thr 


Leu 


Pro 


Trp 


Pro 


Lys Ser 


Ser 


Asp 


Thr 


He 




290 










295 








300 










Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser Ala 


Arg 


Ala 


Leu 


Arg 


305 










310 










315 








320 


Pro 


Pro 


His 


Gly 


Pro 
325 


Pro 


Arg 


Pro 


Gly 


Ala 
330 


Leu Thr 


Pro 


Thr 


Pro 
335 


Ser 


Leu 


Ser 


Ser 


Gin 
340 


Ala 


Leu 


Pro 


Ser 


Pro 
345 


Arg 


His Ala 


His 


Asp 
350 


Arg 


Trp 


Gly Pro 


Pro 


Ser 


Thr 


Asn 


lie 


Pro 


His 


Pro 


Trp Trp 


Gly 


Phe 


Phe 


Leu 






355 










360 








365 









Trp Leu 
370 



<210> 141 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif iedjbase 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 141 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgttgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 
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tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 13 80 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 144 0 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 

<210> 142 
<211> 394 
<212> PRT 

<213> Mus musculus 



<400> 142 



Met lie 


Leu 


Gin 


Ala 


Glv 


J- 11 JL 


IT l*\J 


VJl LI 


j. hi. 


Ser 


Leu 


Leu 


Arg 


val 


Leu 


1 






5 










10 














Phe Leu 


Glv 


Leu 


Ser 


Thr 




Ala 


Ala 


rue 


Ser Arg 


ax a 




Mot- 


blU 






20 










25 










30 






Leu His 


Val 


Pro 


Pro 


Gly 


Leu 


Asn 


Lys 


Leu 


Glu 


Ala 


Val 


Glu 


Gly 


Glu 




35 










40 










45 








Glu Val 


Val 


Leu 


Pro 


Ala 


Trp 


Tyr 


Thr 


Met 


Ala Arg 


Glu 


Glu 


Ser 


Trp 


50 










55 










60 








Ser His 


Pro 


Arg 


Glu 


Val 


Pro 


He 


Met 


He 


Trp 


Phe 


Leu 


Glu 


Gin 


Glu 


65 








70 










75 










80 


Gly Lys 


Glu 


Pro 


Asn 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Gly 


Val 


Met 


Thr 








85 










90 










95 




Asn Lys 


Pro 


Gly 


Thr 


Ala 


Leu 


Val 


His 


Ser 


He 


Ser 


Ser 


Arg 


Asn 


Val 






100 










105 










110 






Ser Leu 


Arg 


Leu 


Gly 


Ala 


Leu 


Gin 


Glu 


Gly 


Asp 


Ser 


Gly 


Thr 


Tyr 


Arg 




115 










120 










125 








Cys Ser 


Val 


Asn 


Val 


Gin 


Asn 


Asp 


Glu 


Gly 


Lys 


Ser 


He 


Gly 


His 


Ser 


130 










135 










140 










lie Lys 


Ser 


He 


Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


145 








150 










155 










160 


Cys Ser 


Leu 


Gin 


Gly 


Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val 


Thr 


Leu 


Asn 








165 










170 










175 




Cys Lys 


Ser 


Pro 


Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin Tyr 


Gin 


Trp 


Glu 


Arg 






180 










185 










190 






Leu Ala 


Pro 


Ser 


Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu 


Asp 


Ala 


Val 




195 










200 










205 








Arg Gly 


Ser 


Leu 


Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met 


Ser 


Gly 


Val 


210 










215 










220 










Tyr Val 


Cys 


Lys 


Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


Lys 


Cys 


Asn 


Val 


225 








230 










235 










240 


Thr Leu 


Asp 


Val 


Met 


Thr 


Gly 


Ser 


Lys 


Ala 


Ala 


Val 


Val 


Ala 


Gly 


Ala 








245 










250 










255 




Val Val 


Gly 


Thr 


Phe 


Val 


Gly 


Leu 


Val 


Leu 


lie 


Ala 


Gly 


Leu 


Val 


Leu 



91 



WO 00/78808 



PCT/US00/16883 



260 265 270 



Leu 


Tyr 


Gin 


Arg Arg 


Ser 


Lys 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala 


Asn 


Asp 


He 






275 










280 










285 






Lys 


GlU 


ASp 


Ala 


He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Trp 


Thr 


Lys 


Gly 


Ser 




290 










295 










300 








Asp 


Thr 


lie 


Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


305 










310 










315 










320 


Aia 


Lieu 


Arg 


Pro 


Pro Lys 


Ala 


Ala 


Pro 


Pro 


Arg 


Pro 


Gly 


Thr 


Phe 


Thr 










325 










330 










335 




Pro 


Tnr 


Pro 


Ser 


Val 


Ser 


Ser 


Gin 


Ala 


Leu 


Ser 


Ser 


Pro 


Arg 


Leu 


Pro 








340 










345 










350 






Arg 


vai 


Asp 


Glu 


Pro 


Pro 


Pro 


Gin 


Ala 


Val 


Ser 


Leu 


Thr 


Pro 


Gly 


Gly 






355 










360 










365 




val 


Ser 


Ser 


Ser 


Ala 


Leu 


Ser 


Arg 


Met 


Gly 


Ala 


Val 


Pro 


Val 


Met 


Val 




370 










375 










380 










Pro 


Ala 


Gin 


Ser 


Gin 


Ala 


Gly 


Ser 


Leu 


Val 















385 390 

<210> 143 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 143 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 24 0 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcatgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 72 0 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc * 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 132 0 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 162 0 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 184 6 

<210> 144 
<211> 394 
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<212> PRT 

<213> Mus musculus 

<400> 144 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 

15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 

20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 

35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 

50 55 60 

Ser His Pro Arg Glu Val Pro lie Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 

85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 

100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 

115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 

130 135 140 

He Lys Ser He Glu Leu Lys Ala Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 

165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 

180 185 * 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 

195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val 

210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 

245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 

260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 

275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 

290 295 300 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
305 310 315 320 

Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr 

325 330 335 

Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro 

340 345 350 

Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly 

355 360 365 

Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val 

370 375 380 

Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
385 390 



<210> 145 
<211> 1846 
<212> DNA 
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<213> Mus musculus 
<400> 145 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 3 00 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 4 80 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag cgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 114 0 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 132 0 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

<210> 146 
<211> 394 
<212> PRT 
<213> Mus musculus 



<400> 146 



Met 


lie 


Leu 


Gin 


Ala 


Gly 


Thr 


Pro 


Glu 


Thr 


Ser 


Leu 


Leu Arg Val Leu 


1 








5 










10 






15 


Phe 


Leu 


Gly 


Leu 


Ser 


Thr 


Leu 


Ala 


Ala 


Phe 


Ser 


A ^g 


Ala Gin Met Glu 








20 










25 








30 


Leu 


His 


Val 


Pro 


Pro 


Gly 


Leu 


Asn 


Lys 


Leu 


Glu 


Ala 


Val Glu Gly Glu 






35 










40 










45 


Glu 


Val 


Val 


Leu 


Pro 


Ala 


Trp 


Tyr 


Thr 


Met 


Ala 


Arg 


Glu Glu Ser Trp 




50 










55 










60 




Ser 


His 


Pro 


Arg 


Glu 


Val 


Pro 


He 


Leu 


He 


Trp 


Phe 


Leu Glu Gin Glu 


65 










70 










75 




80 


Gly 


Lys 


Glu 


Pro 


Asn 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Gly Val Met Thr 










85 










90 






95 


Asn 


Lys 


Pro 


Gly 


Thr 


Ala 


Leu 


Val 


His 


Ser 


lie. 


Ser 


Ser Arg Asn Val 








100 










105 








110 


Ser 


Leu 


Arg 


Leu 


Gly 


Ala 


Leu 


Gin 


Glu 


Gly 


Asp 


Ser 


Gly Thr Tyr Arg 






115 










120 










125 


Cys 


Ser 


Val 


Asn 


Val 


Gin 


Asn 


Asp 


Glu 


Gly 


Lys 


Ser 


He Gly His Ser 
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130 






135 










140 








lie Lys Ser 


He Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro Ser 


14 5 




150 










155 








i fin 


Cys Ser Leu 


Gin Gly Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val 


Thr 


Leu Asn 




165 










170 










175 


Cys Lys Ser 


Pro Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin Tyr 


Gin 


Tro 


Glu Arg 




180 








185 










190 


Leu Val Pro 


Ser Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu 


Asp 


Ala Val 


195 








200 










205 






Arg Gly Ser 


Leu Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met 


Ser 


Gly Val 


210 






215 










220 








Tyr Val Cys 


Lys Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


LVS 


Cys 


Asn Val 

noil v %a jl 


225 




230 










235 










Thr Leu Asp 


Val Met 


Thr 


Gly 


Ser Lys 


Ala 


Ala 


Val 


Val 


Ala 


Gly Ala 




245 










250 












Val Val Gly 


Thr Phe 


Val 


Gly 


Leu 


Val 


Leu 


He 


Ala 


Gly 


Leu 


Val Leu 




260 








265 










270 




Leu Tyr Gin 


Arg Arg Ser 


Lys 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala 


Asn 


Asp He 


275 








280 










285 




Lys Glu Asp 


Ala He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Tro 


Thr 


Lys 


Gly Ser 


290 






295 










300 






Asp Thr lie 


Ser Lys Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


AT a Atct 


305 




310 


















J A \J 


Ala Leu Arg 


Pro Pro 


Lys 


Ala 


Ala 


Pro 


Pro 


Arg 


Pro 


Glv 


Thr 


Phe Thr 




325 










330 










335 


Pro Thr Pro 


Ser Val 


Ser 


Ser 


Gin 


Ala 


Leu 


Ser 


Ser 


Pro 


Arcr 


Leu Pro 




340 








345 










~i ~> \j 




Arg Val Asp 


Glu Pro 


Pro 


Pro 


Gin 


Ala 


Val 


Ser 


Leu 




c -L. \J 




355 








360 










a c 




Val Ser Ser 


Ser Ala 


Leu 


Ser 


Arg Met 


Gly 


Ala 


Val 


Pro 


Val 


Met Val 


370 






375 










380 








Pro Ala Gin 


Ser Gin 


Ala 


Gly 


Ser 


Leu 


Val 












385 




390 





















<210> 147 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 147 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggtccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
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caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 162 0 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 184 6 

<210> 148 
<211> 394 
<212> PRT 

<213> Mus musculus 
<400> 148 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 

1 5 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 

20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 

35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 

50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 

85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 

100 105 no 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 

115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 

130 135 140 

He Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 

165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 

180 185 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 

195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val 

210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 ^ 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 

245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 

260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 

275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 
290 295 300 
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Asp 


Thr 


lie 


Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser Ser Val Thr Ser 


Ala 


Arq 


305 










310 








315 




320 


Ala 


Leu Arg 


Pro Pro Lys 


Ala 


Ala 


Pro Pro Arg Pro Gly Thr 


Phe 


Thr 










325 










330 


335 




Pro 


Thr 


Pro 


Ser 


Val 


Ser 


Ser 


Gin 


Ala 


T.pi •> C^y* Cay Py*o 7A yrt 
UCU OCX. OCX trxw niy 


Leu 


Pro 








340 










345 


350 






Arg 


Val 


Asp 


Glu 


Pro 


Pro 


Pro 


Gin 


Ala 


Val Ser Leu Thr Pro 


Gly Gly 






355 










360 




365 






Val 


Ser 


Ser 


Ser 


Val 


Leu 


Ser 


Arg 


Met 


Gly Ala Val Pro Val 


Met 


Val 




370 










375 






380 






Pro 


Ala 


Gin 


Ser 


Gin 


Ala 


Gly 


Ser 


Leu 


Val 






385 










390 















<210> 149 

<211> 1846 

<212> DNA 

<213> Mus musculus 



<400> 149 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 4 80 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 12 00 

cccaggtggg gtttcttctt ctgttctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

210> 150 
211> 245 
212> PRT 

213> Homo sapiens 
400> 150 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
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1 








5 










10 


15 


Ala 


Pro 


Ser 


Pro 
20 


Trp 


Thr 


Leu 


Ser 


Leu 
25 


Leu 


Leu Leu Leu Leu Leu Pro 
30 


Ser 


Pro 


Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro Phe Gly Glu Asp Asn 






35 










40 






45 


Ser 


lie 
50 


Pro 


Glu 


Ser 


Cys 


Pro 
55 


Asp 


Phe 


Cys 


Cys Gly Ser Cys Ser Ser 
60 


Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He Gin Trp Asn Glu Glu 


65 










70 










75 80 


Met 


Cys 


Pro 


Glu 


Pro 
85 


Glu 


Ser 


Ser 


Arg 


Phe 
90 


Ser Ala His Pro Glu Thr 
95 


Pro 


Glu 


Gin 


Leu 
100 


Gly 


Ser 


Val 


Leu 


Lys 
105 


Tyr 


Gin Ser Ser Leu Asp Ser 
110 


Asp 


Asn 


Met 
115 


Pro 


Gly 


Phe 


Gly 


Ala 
120 


Thr 


Val 


Ala He Gly Leu Thr Val 
125 


Phe 


Val 
130 


Val 


Phe 


He 


Ala 


Thr 
135 


He 


He 


Val 


Cys Phe Thr Cys Ser Cys 
140 


Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg Pro Val Val Ser Asn 


145 










150 










155 160 


Thr 


Thr 


Thr 


Thr 


Thr 
165 


Val 


Val 


His 


Thr 


Ala 
170 


Tyr Pro Gin Pro Gin Pro 
175 


Val 


Ala 


Pro 


Ser 
180 


Tyr 


Pro 


Gly 


Pro 


Thr 
185 


Tyr 


Gin Gly Tyr His Pro Met 
190 


Pro 


Pro 


Gin 
195 


Pro 


Gly 


Met 


Pro 


Ala 
200 


Ala 


Pro 


Tyr Pro Thr Gin Tyr Pro 
205 


Pro 


Pro 


Tyr Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro Ala Tyr His Glu Thr 




210 










215 








220 


Leu 


Ala 


Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro Ala Tyr Met Asp Pro 


225 










230 










235 240 


Pro 


Lys 


Ala 


Val 


Pro 
245 















<210> 151 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca . 360 

gctgggttca gtgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 84 0 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg . 1200 
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cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

9ggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 152 
<211> 245 
<212> PRT 
<213> Homo sapiens 

<400> 152 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 

15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 

20 25 30 

Ser Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn 

35 40 45 

Ser He Pro Glu Ser Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser 

50 55 60 

Gin Tyr Cys Cys Ser Asp Val Leu Lys Lys He Gin Trp Asn Glu Glu 
65 70 75 80 

Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Ala His Pro Glu Thr 

85 90 95 

Pro Glu Gin Leu Gly Ser Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser 

100 105 110 

Asp Asn Met Pro Gly Phe Gly Ala Thr Val Ala He Gly Leu Thr Val 

115 120 125 

Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys Ser Cys 

130 135 140 

Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn 
145 150 155 160 

Thr Thr Thr Thr Thr Ala Val His Thr Ala Tyr Pro Gin Pro Gin Pro 

165 170 175 

Val Ala Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met 

180 185 190 

Pro Pro Gin Pro Gly Met Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro 

195 200 205 

Pro Pro Tyr Leu Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr 

210 215 220 

Leu Ala Gly Ala Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro 
225 230 235 240 

Pro Lys Ala Val Pro 
245 

<210> 153 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 
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cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 
tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 
gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 
tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 3 00 
ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 
gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 
agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 4 80 
ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 
caacaccaca actactaccg cggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 
cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 
agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 
agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 
tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 
catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 
ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 1140 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

9ggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 154 
<211> 245 
<212> PRT 
<213> Homo sapiens 



<400> 154 



Met 


Arg 


Leu 


Phe 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Pro 


Ala 


Met 


Ala 


Ala 


Pro 


1 








5 










10 










15 




Ala 


Pro 


Ser 


Pro 
20 


Trp 


Thr 


Leu 


Ser 


Leu 
25 


Leu 


Leu 


Leu 


Leu 


Leu 
30 


Leu 


Pro 


Ser 


Pro 


Gly 
35 


Ala 


His 


Gly 


Glu 


Leu 
40 


Cys 


Arg 


Pro 


Phe 


Gly 
45 


Glu 


Asp 


Asn 


Ser 


He 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys Gly 


Ser 


Cys 


Ser 


Ser 




50 










55 










60 










Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 
85 


Glu 


Ser 


Ser 


Arg 


Phe 
90 


Ser 


Ala 


His 


Pro 


Glu 
95 


Thr 


Pro 


Glu 


Gin 


Leu 
100 


Gly 


Ser 


Ala 


Leu 


Lys 
105 


Tyr 


Gin 


Ser 


Ser 


Leu 
110 


Asp 


Ser 


Asp 


Asn 


Met 
115 


Pro 


Gly 


Phe 


Gly 


Ala 
120 


Thr 


Val 


Ala 


He 


Gly 
125 


Leu 


Thr 


Val 


Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cys 


Ser Cys 




130 










135 










140 








Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 
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165 

X Q J 






Val Ala Pro Ser 


Tvr 


Pro 


Glv Ptt> Thr 


180 






IOj 


Pro Pro Gin Pro 


Glv 


Met 


Pro Ala Val 










Pro Pro Tyr Leu 


Ala 


Gin 


Pro Thr Gly 










Leu Ala Gly Ala 


Ser 


Gin 


Pro Pro Tyr 


225 




230 




Pro Lys Ala Val 


Pro 
245 
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170 175 
Tyr Gin Gly Tyr His Pro Met 
190 

Pro Tyr Pro Thr Gin Tyr Pro 
205 

Pro Pro Ala Tyr His Glu Thr 
220 

Asn Pro Ala Tyr Met Asp Pro 
235 240 



<210> 155 
<211> 1801 
<212> DNA 

<213> Homo sapiens 
<400> 155 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagtaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcad aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 1140 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 144 0 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 156 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 156 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 
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Ala 


Pro 


Ser 


Pro 


Trp 


Thr 


Leu 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 








20 










25 










30 






Ser 


Pro 


Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 


Gly 


Glu 


Asp 


Asn 






35 










40 










45 








Ser 


He 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 


Ser 


Cys 


Ser 


Ser 




50 










55 










60 










Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 










85 










90 










95 




Pro 


Glu 


Gin 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Tyr 


Gin 


Ser 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Asp 


Asn 


Met 


Pro 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 


He 


Gly 


Leu 


Thr 


Val 






115 










120 










125 








Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cys 


Ser 


Cys 




130 










135 










140 








Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 










165 










170 










175 




Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 








180 










185 










190 






Pro 


Pro 


Gin 


Pro 


Gly 


Met 


Pro 


Ala 


Ala 


Pro 


Tyr 


Pro 


Thr 


Gin 


Tyr 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 




210 










215 










220 










Leu 


Ala 


Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 










230 










235 










240 


Pro 


Lys 


Val 


Val 


Pro 

























245 



<210> 157 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 157 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 12 0 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 24 0 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gtagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 
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tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 132 0 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 13 80 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

99gggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 
a 1801 

<210> 158 
<211> 213 
<212> PRT 
<213> Mus musculus 



<400> 158 


























Met Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Tro 


Thr 


Leu Leu 


Leu 


Leu 


Leu 


T.oi i 


1 






5 










10 








15 




Leu Leu 


Pro 


Pro 


Pro 


Pro 


Gly 


Ala 


His 


Glv 


Glu Leu 


Cys 


Arg 


Pro 


Jrlic 






20 










25 








30 






Gly Glu 


Asp Asn 


Ser 


He 


Pro 


Val 


Phe 


Cys 


Pro Asn 


Phe 


Cys 


Cys 


Gly 




35 










40 


















Ser 


Asn 




TVr 
lyr 






Ser 


Ann 

ASp 


vax Lieu 


Arg 


Lys 


He 


Gin 


50 










CC 
DO 








60 










Trp Asn 


Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser Ser 


Arg 


Phe 


Ser 


Thr 


65 








70 










75 








80 


Pro Ala 


Glu 


Glu 


Thr. 


Pro 


Glu 


His 


Leu 


Gly 


Ser Ala 


Leu 


Lys 


Phe 


Arg 








85 










90 








95 


Ser Ser 


Phe 


Asp 


Ser 


Asp 


Pro 


Met 


Ser 


Gly 


Phe Gly 


Ala 


Thr 


Val 


Ala 






100 










105 








110 






He Gly 


Val 


Thr 


Met 


Phe 


Val 


Val 


Phe 


He 


Ala Thr 


He 


He 


He 


Cys 




115 










120 








125 






Phe Thr 


Cys 


Ser 


Cys 


Cys 


Cys 


Leu 


Tyr 


Lys 


Met Cys 


Cys 


Pro 


Gin 


Arg 


130 










135 








140 








Pro Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val Val 


His 


Ala 


Pro 


Tyr 


145 








150 










155 








160 


Pro Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Val 


Ala 


Pro 


Ser Tyr 


Pro 


Gly Pro 


Thr 








165 










170 








175 




Tyr Gin 


Gly Tyr 


His 


Pro 


Met 


Pro 


Pro 


Pro 


Ala Arg 


Asn 


Ala 


Ser 


Ser 






180 










185 








190 






Thr Leu 


Pro 


Asn 


Ala 


Val 


Pro 


Thr 


Thr 


Leu 


Pro Gly 


Pro 


Ala 


His 


Arg 




195 










200 








205 






Ala Ala 


Thr 


Leu 


Pro 






















210 





























<210> 159 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 159 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 
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cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatg tttgtggtgt ttattgccac tatcatcatc tgcttcacct 4 80 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 114 0 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 160 
<211> 213 
<212> PRT 
<213> Mus musculus 

<400> 160 



Met 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp 


Thr 


Leu Leu 


Leu 


Leu 


Leu 


Leu 


l 








5 










10 








15 




Leu 


Leu 


Pro 


Pro 


Pro 


Pro 


Gly 


Ala 


His 


Gly 


Glu Leu 


Cys 


Arg 


Pro 


Phe 








20 










25 








30 






Gly 


Glu 


Asp 


Asn 


Ser 


He 


Pro 


Val 


Phe 


Cys 


Pro Asp 


Phe 


Cys 


Cys 


Gly 






35 










40 








45 




Ser 


Cys 


Ser 


Asn 


Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val Leu 


Arg 


Lys 


lie 


Gin 




50 










55 








60 










Trp 


Asn 


Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser Ser 


Arg 


Phe 


Ser 


Thr 


65 










70 










75 . 






80 


Pro 


Ala 


Glu 


Glu 


Thr 


Pro 


Glu 


His 


Leu 


Gly 


Ser Ala 


Leu 


Lys 


Phe 


Arg 










85 










90 








95 


Ser 


Ser 


Phe 


Asp 


Ser 


Asp 


Pro 


Met 


Ser 


Gly 


Phe Gly 


Ala 


Thr 


Val 


Ala 








100 










105 








110 






He 


Gly 


Val 


Thr 


He 


Phe 


Val 


Val 


Phe 


He 


Val Thr 


He 


He 


He 


Cys 






115 










120 








125 






Phe 


Thr 


Cys 


Ser 


Cys 


Cys 


Cys 


Leu 


Tyr 


Lys 


Met Cys 


Cys 


Pro 


Gin 


Arg 




130 










135 








140 








Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val Val 


His 


Ala 


Pro 


Tyr 


145 










150 










155 








160 


Pro 


Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Val 


Ala 


Pro 


Ser Tyr 


Pro 


Gly 


Pro 


Thr 










165 










170 






175 




Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 


Pro 


Pro 


Pro 


Ala Arg 


Asn 


Ala 


Ser 


Ser 








180 










185 








190 






Thr 


Leu 


Pro 


Asn 


Ala 


Val 


Pro 


Thr 


Thr 


Leu 


Pro Gly 


Pro 


Ala 


His 


Arg 






195 










200 








205 
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Ala Ala Thr Leu Pro 
210 

<210> 161 
<211> 1858 
<212> DNA 

<213> Mus musculus 
<400> 161 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 12 0 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgtcac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc . 1858 

<210> 162 
<211> 213 
<212> PRT 

<213> Mus musculus 



<400> 162 



Met Ala Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp Thr 


Leu Leu 


Leu 


Leu 


Leu 


Leu 


1 




5 








10 








15 




Leu Leu Pro 


Pro 


Pro 


Pro Gly Ala 


His Gly 


Glu Leu 


Cys 


Arg 


Pro 


Phe 




20 










25 






30 






Gly Qlu Asp 


Asn 


Ser 


He 


Pro 


Val 


Phe Cys 


Pro Asp 


Phe 


Cys 


Cys Gly 


35 










40 






45 








Ser Cys ser 


Asn 


Gin 


Tyr 


Cys 


Cys 


Ser Asp 


Val Leu 


Arg 


Lys 


He 


Gin 


50 








55 






60 










Trp Asn Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro Glu 


Ser Ser Arg Phe 


Ser 


Thr 


65 






70 








75 








80 


Pro Ala Glu 


Glu 


Thr 


Pro 


Glu 


His 


Leu Gly 


Ser Ala 


Leu 


Lys 


Phe 


Arg 
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85 








90 




95 




Ser 


Ser 


Phe 


Asp 
100 


Ser 


Asp 


Pro 


Met 


Ser Gly Phe Gly Ala 
105 


Thr 
110 


Val 


Ala 


He 


Gly 


Val 


Thr 


He 


Phe 


Val 


Val 


Phe He Ala Thr He 


He 


He 


Cys 






115 










120 


125 






Phe 


Thr 
130 


Cys 


Ser 


Cys 


Cys 


Cys 
135 


Leu 


Tyr Lys Met Cys Cys 
140 


Pro 


Gin 


Arg 


Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr Thr Val Ala His 


Ala 


Pro 


Tyr 


145 










150 






155 






160 


Pro 


Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Val 


Ala Pro Ser Tyr Pro 


Gly Pro 


Thr 










165 








170 




175 




Tyr 


Gin 


Gly 


Tyr 
180 


His 


Pro 


Met 


Pro 


Pro Pro Ala Arg Asn 
185 


Ala 
190 


Ser 


Ser 


Thr 


Leu 


Pro 


Asn 


Ala 


Val 


Pro Thr Thr Leu Pro Gly Pro 


Ala 


His 


Arg 






195 










200 


205 






Ala 


Ala 
210 


Thr 


Leu 


Pro 

















<210> 163 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 163 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggctcat gccccttacc ctcagcctca acctcaacet gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttqtttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat . tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

210> 164 
211> 213 
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<212> PRT 

<213> Mus musculus 
<400> 164 

Met Ala Ala Pro Ala Pro Ser Leu Trp Thr Leu Leu Leu Leu Leu Leu 

1 5 10 15 

Leu Leu Pro Pro Pro Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe 

20 25 30 

Gly Glu Asp Asn Ser He Pro Val Phe Cys Pro Asp Phe Cys Cys Gly 

35 40 45 

Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp Val Leu Arg Lys He Gin 

50 55 60 

Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Thr 
65 70 75 80 

Pro Ala Glu Glu Thr Pro Glu His Leu Gly Ser Ala Leu Lys Phe Arg 

85 90 95 

Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 

100 105 110 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 

115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 

130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro Tyr 
145 150 155 160 

Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 

165 170 175 

Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 

180 185 190 

Thr Leu Pro Asn Val Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 

195 200 * 205 

Ala Ala Thr Leu Pro 
210 

<210> 165 
<211> 1858 
<212> DNA 

<213> Mus musculus 
<400> 165 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgta gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 
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gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 166 
<211> 639 
<212> DNA 
<213> Mus musculus 

<400> 166 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 3 00 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat gtttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 54 0 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 167 
<211> 639 
<212> DNA 
<213> Mus musculus 

<400> 167 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

. tttattgtca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 54 0 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 168 
<211> 639 
<212> DNA 
<213> Mus musculus 

<400> 168 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 
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aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 24 0 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggctca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 169 
<211> 639 
<212> DNA 

<213> Mus musculus 
<400> 169 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 12 0 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgt agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 170 
<211> 1218 
<212> DNA 

<213> Homo sapiens 
<400> 170 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg cagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 480 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt . 12 18 

<210> 171 
<211> 1218 
<212> DNA 
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<213> Homo sapiens 



<400> 171 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 42 0 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 4 80 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt " 1218 

<210> 172 

<211> 1219 

<212> DNA 

<213> Homo sapiens 



<400> 172 

catggggccc agcacccctc tcctcatctt gttccttttg tcatggtcgg gacccctcca 60 

aggacagcag caccaccttg tggagtacat ggaacgccga ctagctgctt tagaggaacg 120 

gctggcccag tgccaggacc agagtagtcg gcatgctgct gagctgcggg acttcaagaa 180 

caagatgctg ccactgctgg aggtggcaga gaaggagcgg gaggcactca gaactgaggc 240 

cgacaccatc tccgggagag tggatcgtct ggagcgggag gtagactatc tggagaccca 300 

gaacccagct ctgccctgtg tagagtttga tgagaaggtg actggaggcc ctgggaccaa 360 

aggcaaggga agaaggaatg agaagtacga tatggtgaca gactgtggct acacaatctc 420 

tcaagtgaga tcaatgaaga ttctgaagcg atttggtggc ccagctggta tatggaccaa 480 

• ggatccactg gggcaaacag agaagatcta cgtgttagat gggacacaga atgacacagc 540 

ctttgtcttc ccaaggctgc gtgacttcac ccttgccatg gctgcccgga aagcttcccg 600 

agtccgggtg cccttcccct gggtaggcac agggcagctg gtatatggtg gctttcttta 660 

ttttgctcgg aggcctcctg gaagacctgg tggaggtggt gagatggaga acactttgca 720 

gctaatcaaa ttccacctgg caaaccgaac agtggtggac agctcagtat tcccagcaga -780 

ggggctgatc cccccctacg gcttgacagc agacacctac atcgacctgg cagctgatga 840 

ggaaggtctt tgggctgtct atgccacccg ggaggatgac aggcacttgt gtctggccaa 900 

gttagatcca cagacactgg acacagagca gcagtgggac acaccatgtc ccagagagaa 960 

tgctgaggct gcctttgtca tctgtgggac cctctatgtc gtctataaca cccgtcctgc 1020 

cagtcgggcc cgcatccagt gctcctttga tgccagcggc accctgaccc ctgaacgggc 1080 

agcactccct tattttcccc gcagatatgg tgcccatgcc agcctccgct ataacccccg 1140 

agaacgccag ctctatgcct gggatgatgg ctaccagatt gtctataagc tggagatgag 1200 

gaagaaagag gaggaggtt 1219 

<210> 173 

<211> 1218 

<212> DNA 

<213> Homo sapiens 
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<400> 173 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 240 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 480 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagtc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt ^ " " " 121 8 

<210> 174 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 174 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggtagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 
gtcttcgct ~ 

<210> 175 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 175 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagtcttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 
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tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 72 0 

gtcttcgct 729 

<210> 176 
<211> 729 
<212> DNA 
<213> Mus musculus 

<400> 176 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgcgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 4 80 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 177 
<211> 729 
<212> DNA 
<213> Mus musculus 

<400> 177 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tattgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 178 
<2il> 1218 
<212> DNA 

<213> Mus musculus 
<400> 178 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtatt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 4 20 
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caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt " ~ 12 is 

<210> 179 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 179 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atagtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 1218 

<210> 180 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 180 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gcgttagacg gcacccagaa cgacacggct 540 
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tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt.gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt ~' 12 is 

<210> 181 

<211> 1218 

<212> DNA 

<213> Mus musculus 



<400> 181 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 3 00 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 3 60 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgtcatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 1218 

<210> 182 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 182 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggcg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 24 0 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 
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tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 4 80 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 183 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif iedjbase 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 183 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gcctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 184 

<21l> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 184 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 
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caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtatc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 84 0 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 185 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 185 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgttgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 186 

<211> 1182 

<212> DNA 

<213> Mus musculus 

<400> 186 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 
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gaggagtcgt ggtcccaccc ccgggaggtg cccatcatga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 187 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 187 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagcgctgg ttcctccagc tcctccatcc 4 80 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 188 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 188 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 
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ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 4 80 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg tcccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 189 

<211> 1182 

<212> DNA 

<213> Mus musculus 

<400> 189 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgttc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 190 
<211> 735 
<212> DNA 
<213> Homo sapiens 



<400> 190 
atgcgtctgt 
tggacccttt 
tgcaggccct 
tcctgttcca 
atgtgccctg 
ggttcagtgc 
accgtggcca 
acctgctcct 
accacaacta 
tatcctggac 
gcaccctacc 



ttgtccgtcc 
cgctgctgct 
tcggtgaaga 
gccaatactg 
agccagagtc 
tgaagtatca 
tcggcctgac 
gctgctgtct 
ctaccgtggt 
caacatacca 
caacgcagta 



gtccgtccgt 
gttgttgcta 
caattcgatc 
ctgctctgac 
cagcagattt 
gtccagtctt 
cgtcttcgtg 
atataagatg 
tcacaccgct 
gggctaccat 
ccctccaccc 



cccgccatgg 
ctgccgtctc 
ccagagtcct 
gtgctgaaga 
tccgcccacc 
gacagtgaca 
gtgtttatcg 
tgctgccgcc 
taccctcagc 
cccatgcccc 
tacctggccc 



ctgcgccggc 
cgggtgccca 
gtcctgactt 
aaatccagtg 
cggagacacc 
acatgccagg 
ctaccatcat 
cacgacctgt 
ctcaacctgt 
cccagccagg 
agcccacagg 



gccctctccg 
tggcgagctg 
ctgttgtggc 
gaatgaggaa 
agaacagctg 
gttcggagcg 
tgtgtgcttt 
cgtgtccaac 
ggcccccagc 
aatgccagca 
gccaccagcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 
ccaaaggcag ttccc 735 

<210> 191 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 191 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 12 0 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgcggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 

ccaaaggcag ttccc 735 

<210> 192 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 192 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gtaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 

ccaaaggcag ttccc 735 

<210> 193 
<211> 22 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 193 

caaagtgagc tcatgctctc ac 22 

<210> 194 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> reverse primer 
<400> 194 

ctctggtctt gggcagaaat c 21 

<210> 195 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 195 

ggagtatcct tggtctactc c 21 

<210> 196 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 196 

gaaagtctgg aaggatggaa get 23 

<210> 197 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 197 

ggatgatggc taccagattg tc 22 

<210> 198 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 198 

ggaacattga gggttttgac tc 22 
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